But then why is there compounding token usage in the article's trivial solution? Is it just a matter of using the cache correctly?