| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by msp26 121 days ago
	Horrific comparison point. LLM inference is way more expensive locally for single users than running batch inference at scale in a datacenter on actual GPUs/TPUs.

1 comments

How is that horrific? It sets an upper bound on the cost, which turns out to be not very high.