|
|
|
|
|
by ScottAaronson
2915 days ago
|
|
On further thought, we should distinguish two problems: (1) "Induce" the rules, in the sense that after training on millions of played games, you now have a neural net or whatever that plays mostly or entirely according to the rules even though it can't articulate them. (2) Output an explicit description of the rules. I could easily believe that (2) is beyond the current abilities of AI, even if it turns out that (1) is doable. |
|
To be honest, I don't think there's much work on inducing the rules of chess, in particular. It's probably considered a) easy enough to do by hand and b) too hard to machine-learn.
>> On further thought, we should distinguish two problems:
(1) - Yep. The most likely approach would be a classifier trained to label moves as legal/ illegal. The resulting model would be a vector of numerical parameters so not a traditional rule base. It would also only be correct within some margin of error, probably not 0, limiting its uses (e.g. it wouldn't make sense to train it to play and then pit it against a player with a correct rulebase; they wouldn't be playing the same game).
>> I could easily believe that (2) is beyond the current abilities of AI, even if it turns out that (1) is doable.
Also yes. Exactly on point in fact.
When we're talking about learning rules, we 're talking about learning automata, the subject of inductive inference, an older branch of machine learning (well, ish) that fell out of favour after a bunch of theoretical results showed it was basically impossible to learn any interesting class of automata from examples (the most famous is Mark E. Gold's result from Language Identification in the Limit; only finite languages can be learned from finite examples, anything else is only learnable "in the limit", from infinitely many examples, or an all-knowning oracle, etc).
Modern machine learning starts with Valiant's A Theory of the Learnable, which introduced PAC learning and a relaxation in the assumptions of inductive inference, about what should (and, therefore, can) be learned.
In short, the difference is that inductive inference attempted to learn complete definitions of various automata, whereas modern machine learning attempts to approximate them; well, strictly speaking it's about approximating functions, not automata as such.
So yeah, pretty much, like you say: (2) is, in principle, not possible whereas (1) might even be possible in practice.
Now, normally this is where I'd plug my own research, but I've already written plenty :)