Y
Hacker News
new
|
ask
|
show
|
jobs
by
lostmsu
119 days ago
The implementation details are irrelevant to the discussion of the true cost of running the models.
1 comments
mzl
119 days ago
The cost of running things like prompt caching is defined by the implementation as that gives the infrastructure costs.
link