Y
Hacker News
new
|
ask
|
show
|
jobs
by
hereonout2
486 days ago
Didn't the deepseek paper itself state they trained on 2048 H200s?
Claiming they have access to 5x this amount is not such a bold claim?