Hacker News new | ask | show | jobs
by Our_Benefactors 37 days ago
> Meanwhile Deepseek V3’s famously frugal training was $5M

And widely derided once the team was unable to provide receipts. It’s more likely to be 10x

1 comments

Why make up things? The papers are published completely and apples to apples compares 5M final training run against grok 3.5 (400M)final training run.
Oh, it was written in a paper, must be correct then, no further investigation required just believe it at face value! No track record of academic dishonestly, and definitely no incentives to fudge the numbers.