Hacker News new | ask | show | jobs
by pavedwalden 3136 days ago
That simulation reminds me of the flash game 'QWOP'
1 comments

Had the same thought. :) I wonder if anyone has done a challenge using that game which has simpler inputs I believe?
Simplest RL algorithm (Q-learning) achieves 100m in QWOP: https://www.youtube.com/watch?v=e27TUmMkOA0

Although it found and exploited a local maximum of "knee scraping" technique (which humans can replicate) :)