Hacker News new | ask | show | jobs
by ilaksh 340 days ago
Yes, prompt caching helps a lot with the cost. It still adds up if you have some tool outputs with long text. I have found that breaking those out into subtasks makes the overall cost much more reasonable.