The thing is, the same agent that made the bananas mistake is also quite good at catching that mistake (if called again with fresh context). This results in convergence on working, non-bananas solutions.
Look up The Old Lady who Swallowed a Fly. Or The King, the Mice and his Cheese
What you propose makes things worse, not better
LLMs are magnificent tools, but there needs to be a human hand holding them.
Nothing I have seen anywhere, yet, challenges my view that "agents" will not be a good idea until we have better technology, that there is no sign of yet (?), than LLMs.
What you propose makes things worse, not better
LLMs are magnificent tools, but there needs to be a human hand holding them.
Nothing I have seen anywhere, yet, challenges my view that "agents" will not be a good idea until we have better technology, that there is no sign of yet (?), than LLMs.