Hacker News new | ask | show | jobs
by jasondclinton 97 days ago
If you use context cacheing, it saves quite a lot on the costs/budgets. You can cache 900k tokens if you want.