| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Stoids 607 days ago
	We aren’t good at creating software systems from reliable and knowable components. A bit skeptical that the future of software is making a Rube Goldberg machine of black box inter-LLM communication.

5 comments

mmcdermott 607 days ago

I could see software having a future as a Rube Goldberg machine of black box AIs, if hardware is cheap enough and the AIs are good enough. There was a scifi novel (maybe "A Fire Upon the Deep"?) where there was no need to write software because AI could cobble any needed solution together by using existing software and gluing it together. Throwing cycles at deepening layers was also something that Paul Graham talked about in the hundred year language (https://paulgraham.com/hundred.html).

Now, whether hardware is cheap enough or AI is smart enough is an entirely different question...

link

dsv3099i 607 days ago

As someone who makes HW for a living, please do make more Rube Goldberg machines of black box LLMs. At least for a few more years until my kids are out of college. :)

link

paulddraper 607 days ago

Not only is it the future of software, it's the past and present as well.

link

keeganpoppen 607 days ago

“we arent good at it, so we shouldnt admit the possibility that a fundamentally different entity and architecture might be better suited”

link

TZubiri 607 days ago

I'm pretty sure this is a satire post

link

pizza 607 days ago

Here's a practical in this vein but much simpler - if you're trying to answer a question with an LLM, and have it answer in json format within the same prompt, for many models the accuracy is worse than just having it answer in plaintext. The reason is that you're now having to place a bet that the distribution of json strings it's seen before meshes nicely with the distribution of answers to that question.

So one remedy is to have it just answer in plaintext, and then use a second, more specialized model that's specifically trained to turn plaintext into json. Whether this chain of models works better than just having one model all depends on the distribution match penalties accrued along the chain in between.

link

TZubiri 606 days ago

I wrap the plaintext in quotes, and perhaps a period, so that it knows when to start and when to stop, you can add logit biases for the syntax and pass period as a stop marker to chatgpt apis.

Also you don't need to use a model to build a json from plaintext answers lol, just use a programming language.

link

from-nibly 607 days ago

So developing solutions with ai is like trying to build stuff with family feud.

link