|
|
|
|
|
by tshadley
1236 days ago
|
|
> An NN is simply an approximation of a multi-valued function, whose parameters are adjusted by minimizing the difference between the output of the NN and the output of the real function for a certain input. Right, but that equally fits a biological NN if you zoom in that close. You'll need more than wikipedia to appreciate what deep-neural-networks are doing here, it's dimensional space that's key. What DNNs do that is similar to the human brain is that they order "concepts" in high-dimensional space. Colors, textures, shape and hierarchies of same are organized and cross-referenced with text in an incredibly complex connectome. It would be useless to memorize images with their textual descriptions as that would be horrendously inefficient/ineffective during inference. Rather, the model must do what we do and understand what makes an image a "landscape" or a "portrait" or a "cartoon". It needs to understand what is an artist's style and how to perform it on a work never before created. "Understanding" can only mean ordering meaningless letters and pixels in multidimensional space so that they line up with human understanding (and human 'understanding', in turn, can only mean ordering meaningless sensory perceptions in the brain's multidimensional connectome such that reality turns out to be approximately predicted and controlled). The only systems that work this way efficiently are neural networks, biological and artificial. |
|
First of all, we have no idea how biological NNs learn, how they represent information, how they reason etc. Given what we do know, there is no reason to assume any similarity with ANNs on any of those fronts. Just to give one example, we know very well that a single biological neuron encodes significant information and is capable of reasoning on its own. In fact, even non-neuronal biological cells are capable of such - especially looking at single-celled organisms, which display extraordinarily complex behaviors with no NN in sight.
Second of all, we don't exactly understand how the huge models we have actually encode the higher-level representations of the training set that they store. Of course, we can say for sure that they are not literally storing a copy of the data on simple space requirements. But we can also say for sure that their "understanding" of the data, as well as their capacity for inference, is significantly different from our own - since they make certain mistakes that are nearly impossible for a human to make, while showing super human abilities in other aspects. So, if anything, we must conclude that whatever it is they are doing, it is most certainly not a way of understanding the information the way we understand it.