Y
Hacker News
new
|
ask
|
show
|
jobs
by
msp26
121 days ago
Horrific comparison point. LLM inference is way more expensive locally for single users than running batch inference at scale in a datacenter on actual GPUs/TPUs.
1 comments
AlexandrB
121 days ago
How is that horrific? It sets an upper bound on the cost, which turns out to be not very high.
link