Hacker News new | ask | show | jobs
by dartos 638 days ago
Tbf RL is pretty incredible.

I trained a model to play a novel video game using only screenshots and a score using RL and I discovered how not to lose