Hacker News new | ask | show | jobs
by kirill5pol 1799 days ago
Maybe true if you consider policy gradient methods and Q learning the only things that exist in RL… it’s a pretty wide field that encompasses a lot more than the stuff OpenAI puts out.