Hacker News new | ask | show | jobs
by caycep 1399 days ago
For some reason this reminds me of that whole deep reinforcement learning / "policy" algorithms