Y
Hacker News
new
|
ask
|
show
|
jobs
by
iaml
1988 days ago
GPT3 is so big it would take 355 years to train on a nvidia V100, so your example is also not really useful for comparison. It would be interesting to see some mid-sized nn benchmarks though.