| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kirill5pol 1799 days ago
	Maybe true if you consider policy gradient methods and Q learning the only things that exist in RL… it’s a pretty wide field that encompasses a lot more than the stuff OpenAI puts out.