Hacker News new | ask | show | jobs
by johnmoberg 2243 days ago
Other than the AlphaZero papers and DQN, maybe TRPO? (assuming we're talking about deep RL)