Hacker News new | ask | show | jobs
by ronsor 103 days ago
I assure you datacenter GPUs like B200 do fail regularly (within months in many cases), so much so that it's a problem for labs doing large training runs.