Hacker News new | ask | show | jobs
by simonw 524 days ago
I really need to write that up in one place - closest I've got is this section from my 2024 review https://simonwillison.net/2024/Dec/31/llms-in-2024/#-agents-...
6 comments

>"The two main categories I see are people who think AI agents are obviously things that go and act on your behalf—the travel agent model—and people who think in terms of LLMs that have been given access to tools which they can run in a loop as part of solving a problem."

This is exactly the problem and these two categories nicely sum up the source of the confusion.

I consider myself in the former camp. The AI needs to determine my intent (book a flight) which is a classification problem, extract out the relevant information (travel date, return date, origin city, destination city, preferred airline) which is a Named Entity Recognition problem, and then call the appropriate API and pass this information as the parameters (tool usage). I'm asking the agent to perform an action on my behalf, and then it's taking my natural language and going from there. The overall workflow is deterministic, but there are elements within it that require some probabilistic reasoning.

Unfortunately, the second camp seems to be winning the day. Creating unrealistic expectations of what can be accomplished by current day LLMs running in a loop while simultaneously providing toy examples of it.

Whoa missed this! Love it.
The language problem around agents is that most companies are describing them solely from a human/UX perspective.

'You ask it to do something, and it does it'

That makes it difficult to differentiate the more critical 'how' options in the execution process. From that perspective: deterministic integrations, LLM+tools, LAM, etc are more descriptive categories, each with their own capabilities, strengths, and weaknesses.

Or to put it a different way, if the term doesn't tell you what something is good and bad at, it's probably an underspecified term.

I took a crack at it here that tries to bridge the gap from "autonomous" which is just software to that Agentic autonomy - https://www.aiimpactfrontier.com/p/framework-for-ai-agents
This write up was also fantastic and has made the rounds at our org!
this is a great write up, thank you