|
|
|
|
|
by ArtWomb
2202 days ago
|
|
Humans observe an object once, such as a cup for drinking water, and we immediately grasp its "cupness". We can identify infinite varieties of cups despite variations in morphology, design, utility and context. Simply based on a single learning instance. This absence of any neural theory of inference is at the crux of the problem ;) Shortcut Learning in Deep Neural Networks https://arxiv.org/abs/2004.07780 |
|
Adults do (i.e. the agents pretrained holistic model of its entire observed physical context). By reducing the phenomenon to the single observation, you're conveniently ignoring the early childhood phases spent exploring shapes/3d-geometry that enable this very ability of inference. this isn't fair, because regarding humans, the line between training-phase and trained model is very blurry, whereas a statistical model is trained when the weights are set and done.
Brute forcing through 2d-projections of 3d-objects (further denormalized through camera-artifacts etc.) until something sticks in a convoluted (heh) composition of arbitrarily initialized set of nodes and connections is obviously far different from the physical exploration kids do. Comparing the models resulting from the latter with the former is, in a word, absurd.
Through exploration, humans develop a model of physics itself, from which the nature of cupness can be inferred (which is, in fact, the magic term).
Deep learning alone won't get us there, but it'll probably give us the components that enable us to simulate this intricate process happening in kids brains.
In fact, I'm pretty sure that that's what a lot of the smart people researching general intelligence are working on (because that's what I would do, excuse my hybris).