Hacker News new | ask | show | jobs
by thomashop 405 days ago
I'm pretty sure the model is cached with the system prompt already processed. So you should only pay extra tokens.