Hacker News new | ask | show | jobs
by StevenWaterman 148 days ago
Cached tokens are cheaper (90% discount ish) but not free
1 comments

Also, unlike OpenAI, Anthropic's prompt caching is explicit (you set up to 4 cache "breakpoints"), meaning if you don't implement caching then you don't benefit from it.
thats a very generous way of putting it. Anthropic's prompt caching is actively hostile and very difficult to implement properly.