| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ivanvoid 94 days ago
	Obviously I familiar with RL, written multiple training pipelines in my day. and in order to gain that “super human skill” using RL you need to define fit functions and provide environments that will provide you with feedback that used for training. Go and chess are have clear rules and environment that provide you with a signal of success, I waiting to see this for coding, I don’t say it’s impossible just orders of magnitude harder