|
|
|
|
|
by simonw
485 days ago
|
|
Wow, that's really surprising. My experience with much simpler RAG workflows is that once you stick a number in the context the LLMs can reliably parrot that number back out again later on. Presumably Deep Research has a bunch of weird multi-LLM-agent things going on, maybe there's something about their architecture that makes it more likely for mistakes like that to creep in? |
|
https://www.ben-evans.com/benedictevans/2025/1/the-problem-w...