|
|
|
|
|
by famouswaffles
657 days ago
|
|
It's not sloppy. It just doesn't matter in the limit of training. 1. An Octopus and a Raven have wildly different brains. Both are intelligent. So just the idea that there is some "one true system" that the NN must discover or converge on is suspect. Even basic arithmetic has numerous methods. 2. In the limit of training on a diverse dataset (ie as val loss continues to go down), it will converge on the process (whatever that means) or a process sufficiently robust. What gets the job done gets the job done. There is no way an increasingly competent predictor will not learn representations of the concepts in text, whether that looks like how humans do it or not. |
|
So you agree with me that there is no guarantee it learns any representation of the actual process that produced the training data.