|
|
|
|
|
by _jayhack_
1148 days ago
|
|
This logic only applies to generative pre-training, behavior cloning, and other training methods which rely on learning to mimic well-structured content from the real world. It does not apply to intelligence gathered through methods like RL. How does the author think about the intelligence of AlphaGo, for instance, which was trained entirely by self-play? |
|
As for AlphaGo, I would put it the same category of intelligence as a calculator. It does one thing well -- approximate a Monte Carlo Tree Search.