|
|
|
|
|
by Someone
2807 days ago
|
|
”When you increase the number of parameters (weights) in an NN by a factor of 5 you don’t just get 5 times the capacity and need 5 times as much training data. In terms of expressive capacity increase it’s more akin to a number with 5 times as many digits. So if V8’s expressive capacity was 10, V9’s capacity is more like 100,000.” I find it very, very hard to believe that. I know ‘expressive power’ is a fairly vague concept, but if things scale that well, there must be papers out there that at least hint at such (IMHO) insane scaling laws. I think it also must mean that it is fairly easy for those with huge budgets to build a system that’s way better, except for the fact that it is too slow or takes too much power (just as 3D graphics in movies show what will be on our desks/phones in a decade or two) I’ve asked it before, but does anybody know of papers that describe an offline self-driving system that’s as good as perfect? |
|