|
|
|
|
|
by grondilu
3060 days ago
|
|
> A nice win against Gnuchess Not sure why you guys don't show the PGN, but here you are: 1. e4 Nc6 2. Nf3 Nf6 3. Nc3 d5 4. exd5 Nxd5
5. Bb5 Nf4 6. O-O Bf5 7. d4 Nd5 8. Ne5 Qd6
9. Nxd5 Qxd5 10. Bxc6+ bxc6 11. c4 Qd6 12. Qf3 g6
13. Nxc6 Bg7 14. Bf4 Qe6 15. Rfe1 Qxc4 16. Rxe7+ Kf8
17. Rae1 Kg8 18. b3 Qc2 19. Qd5 Be6 20. R7xe6 fxe6
21. Qxe6+ Kf8 22. Bh6 Bxh6 23. Qf6+ Kg8 24. Ne7# Question : when you switch to self-play reinforcement learning, do you plan on starting from the networked obtained in supervised learning or tabula rasa? I understand starting from tabula rasa will require more comptuting power/time, but if you start from the supervised learning network, isn't there a risk you inherit human biases in the game style? It would also defeat the purpose of having the system discover existing chess theory and possibly new one. |
|
Should be fun to watch it learn chess theory :).