Hacker News new | ask | show | jobs
by svalorzen 1741 days ago
I maintain a repository of many implementations of classical (tabular) RL algorithms [1] which you might enjoy playing with when starting out. I use it for both research and for student projects. The advantage of avoiding NNs when starting out is that it is much simpler to inspect the inner workings of an algorithm to see whether it's working or not.

I'm always happy to help if something is unclear or difficult so feel free to open issues there :)

[1]: https://github.com/Svalorzen/AI-Toolbox