| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by chatmasta 619 days ago
	Token costs are not zero when you’re running local models, because you paid for the hardware, and you can’t scale inference indefinitely without paying for more hardware.