| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by tlrobinson 2959 days ago
	I was curious if anyone had tried to train an AI to play QWOP. Of course they have: https://www.youtube.com/watch?v=e27TUmMkOA0 (among others) I wonder if you could get a better result by including other factors in the reward function, like trying to maintain a slight forward lean.