Y
Hacker News
new
|
ask
|
show
|
jobs
by
MattRix
48 days ago
The open models of similar scales (ex. the new 1T deepseek model) are a fraction of the cost per token, so I don’t see how that can be the case. Inference is profitable, it’s the training that makes it unprofitable.