|
|
|
|
|
by sdenton4
1532 days ago
|
|
Hardware is expensive to iterate on. ML research is already expensive, without worrying about hardware. I expect we'll see plenty of additional attempts in robotics, regardless of what makes economic sense for OpenAI in the short run. "No statistical model can escape this trap" Your claim here is that intelligence requires innovation? AlphaGo certainly went beyond the bounds of the existing training data. Likewise, zero-shot learning (as we see in Dall-e 2) demonstrates the ability to combine concepts combinatorially, rather than drawing from raw prior observation. I still wouldn't call this intelligence, but it's yet another indication of how the goalposts move in the conversation. (Never mind that we typically at this point ask to satisfy indicators which most humans could not satisfy...) For just about any simple indicator of intelligence there's been a concerted effort to make a neutral network with that property. And most of them have had a degree of success, moreso over time. The 'confusion' comes because these simple indicators have repeatedly been set and overcome. |
|
I mean, AlphaZero is trained solely on self-play. It is not even given the rules of the game, it exists in the world where it is rewarded or punished by the 'laws of physics' of the Go board the way we exist in a environment with physical rules that constrain and reward or punish our biology.
To say that AlphaZero is just data compression of the inputs seems hand wavy. It is data compression only in the sense that phenomena from the world is stream of data, and humans developing representations of that data (eg laws of physics) around that data are a compression of it.
But AlphaZero wasn't given a huge feed of pre-played world data. Rather, it interacted with, and poked around in a simulated environment, until it was able to make good predictions on how its interactions would turn out. I learn that dropping a ball falls to the ground, and so I can make a prediction of what happens if I drop a ball. How is AlphaZero predicting the outcome of moves purely from self-play just another kNN? If so, why isn't our brain's learning just a kNN then?