Hacker News new | ask | show | jobs
by yorwba 254 days ago
You'll first need to frame Towers of Hanoi as a supervised learning problem. I suspect the answer to your question will differ depending on what you pick as the input-output pairs to train the model on.