Hacker News new | ask | show | jobs
by sporkland 36 days ago
Is there any current research on as agents w/tools start dominating LLM use, if making making models smaller / less single-shot, more like efficient engines that can process a lot of context, and feeding a lot more into context windows is going to be more of a path forward vs trying to memory the world?

Like smaller models that show effectiveness on problems with verifiable rewards when run in a loop with external grounding context?