Hacker News new | ask | show | jobs
by 7777777phil 57 days ago
50% cache savings is nice buuuuut.. token cost is the product now, not the model. Every coding agent is converging on the same trick, cache the context across calls. Anthropic's prompt caching, then Cursor's snapshots, now this Reddit thing.