Y
Hacker News
new
|
ask
|
show
|
jobs
by
dragonwriter
444 days ago
> > [train]: "The training runs on 64 A100 GPUs over nine days",
> How is that a "year and half of GPU time".
64 GPUs × 9 days = 576 GPU-days ≈ 1.577 GPU-years
1 comments
refulgentis
444 days ago
Doh, that's entirely fair: haven't been in this thread yet, but would echo what I perceive as implicit puzzlement re: this amount of GPU time being described as bitter-lesson-y.
link