| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by MattRix 48 days ago
	The open models of similar scales (ex. the new 1T deepseek model) are a fraction of the cost per token, so I don’t see how that can be the case. Inference is profitable, it’s the training that makes it unprofitable.