Hacker News new | ask | show | jobs
by lostmsu 119 days ago
The implementation details are irrelevant to the discussion of the true cost of running the models.
1 comments

The cost of running things like prompt caching is defined by the implementation as that gives the infrastructure costs.