- https://platform.openai.com/docs/guides/prompt-caching
- https://platform.claude.com/docs/en/build-with-claude/prompt...
- https://ai.google.dev/gemini-api/docs/caching
But then why is there compounding token usage in the article's trivial solution? Is it just a matter of using the cache correctly?
But then why is there compounding token usage in the article's trivial solution? Is it just a matter of using the cache correctly?