Hacker News new | ask | show | jobs
by idiotsecant 432 days ago
Aren't we already emulating it? It's sort of a distributed and overlaid reward function, which we just undistributed