|
|
|
|
|
by Veedrac
1482 days ago
|
|
That quote is disingenuous. Do people really think that... * Jeff Dean, lead of Google's AI division, wrote a paper with all that complexity to get SOTA on CIFAR-10? * Jeff Dean, whose salary is sometimes estimated as $3m/y and is responsible for the direction of research of many more, is unreasonable for using <$60k of compute at public pricing, and less than that at internal pricing? * going from a 0.6% error rate to a 0.57% error rate is reasonably summarized as ‘a 0.03% improvement’, ignoring both that it's a 5% reduction in error and that such improvements get harder as you approach (or exceed) the label accuracy of the dataset? * the accuracy from this paper came purely from scale? |
|
Re-training SotA with a different random seed may make its score 0.03% difference. Or there was a wrong calculation in 17,810 TPU core-hours due to faulty hardware or cosmic ray hit which cause the final produce model 0.03% difference.