Hacker News new | ask | show | jobs
by deadeye 21 days ago
I don't think compacting often is good for saving money. It generates more output tokens and then the input is no longer from cache, which is priced differently...typically very differently.