Hacker News new | ask | show | jobs
by Eliezer 656 days ago
Q* is also a term from reinforcement learning.