Hacker News new | ask | show | jobs
by 0xbadcafebee 19 days ago
The cheaper ones are fp4 and fp8 whereas I assume DeepSeek provider is unquantized, so that probably accounts for it. DeepSeek also doesn't necessarily have the cheapest hardware, other providers could be using it as a loss leader, etc
1 comments

I belive no sane provider, antropic and openai included, serve BF16.

Side note: I suspect Antropic was experimenting with changing quant level based on server load a few months back which is what caused that major quality drop we saw then.