Y
Hacker News
new
|
ask
|
show
|
jobs
by
Our_Benefactors
37 days ago
> Meanwhile Deepseek V3’s famously frugal training was $5M
And widely derided once the team was unable to provide receipts. It’s more likely to be 10x
1 comments
gmerc
37 days ago
Why make up things? The papers are published completely and apples to apples compares 5M final training run against grok 3.5 (400M)final training run.
link
Our_Benefactors
37 days ago
Oh, it was written in a paper, must be correct then, no further investigation required just believe it at face value! No track record of academic dishonestly, and definitely no incentives to fudge the numbers.
link