Hacker News new | ask | show | jobs
by weird-eye-issue 79 days ago
Even more than that in practice once you factor in prompt caching
1 comments

I think we still skew back to an insanely high input token ratio when you consider agentic loops. For example, when I see the tools I use do a web fetch or a search or other tool use, it's an incredibly high number of new input tokens.