Hacker News new | ask | show | jobs
by wongarsu 77 days ago
However output tokens are 5-10 times more expensive. So it ends up a lot more even on price
1 comments

Even more than that in practice once you factor in prompt caching
I think we still skew back to an insanely high input token ratio when you consider agentic loops. For example, when I see the tools I use do a web fetch or a search or other tool use, it's an incredibly high number of new input tokens.