| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by iamnotagenius 381 days ago
	With all due respect to Deepseek, I would take their numbers with grain of salt, as they might as well be politically motivated.

3 comments

jarym 381 days ago

Any more politically motivated than a model from anywhere else?

link

pama 381 days ago

The current version of sglang allows inference with the R1 model at a cost that is very close to the rate that DeekSeep claimed (using H100s, not exactly the DeepSeek compute). Their claim is almost validated by replication at this point so there is nothing left to take with a grain of salt other than the possibility that there exists potentially an even higher margin than what they claimed if one were to optimize for modern NVidia hardware.

link

WithinReason 381 days ago

is that better or worse than commercially motivated?

link

leeoniya 381 days ago

commercial motivatation needs to show eventual profit to be sustainable, while political does not.

though at the outset (pre-profit / private) it's hard to say there's much difference.

link

bee_rider 381 days ago

> though at the outset (pre-profit / private) it's hard to say there's much difference.

I think this is the tough part, we’re at the outset still.

Also, a political investment could could be sustainable, in the sense that China might decide they are fine running Deepseek at a loss indefinitely, if that’s what’s going on (hypothetically. Actually I have never seen any evidence to suggest Deepseek is subsidized, although I haven’t gone looking).

link

lazide 381 days ago

Also, solar panel dumping as a quite successful example (on many, many fronts).

link