Hacker News new | ask | show | jobs
by qPM9l3XJrF 1753 days ago
I think you'd be better off using a Gaussian process than reinforcement learning