|
|
|
|
|
by versteegen
1227 days ago
|
|
> It will never learn math this way, no matter how much training you give it. Not so. Actually, (for example) the phenomenon of "grokking" is when with enough training a NN eventually experiences a phase-change from memorising data to learning the general rules underlying it [1]. Grokking isn't actually desirable, it's better that the model go more directly and quickly to learning the general rule, which is achievable in toy problems (called "comprehension" in [2]). I feel that people seem to have forgotten that deep learning is so powerful because it performs feature/representation learning, not because it can memorise, although that's powerful too. IMO that is the proper definition of 'deep learning'. [1] Power &al. Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets https://arxiv.org/abs/2201.02177 [2] Liu &al. Towards Understanding Grokking: An Effective Theory of Representation Learning https://arxiv.org/abs/2205.10343 |
|
It kinda reminds me of DeepBlue. In fact, a simple DFS has always been able to beat human in the chess, but, only in 1990s, a computer finally could beat a chess grandmaster. Reason? Because a dumb DFS is impractically slow, and the human player will die old before the computer can finish its calculation.
I believe the same goes with the current AI trend. What we have right now is rather crude. The approach itself has lots of potential, but the actual solution is yet to be found. It's really sad that people keep hyping up these partial solutions as zee AI. Whatever.