Hacker News new | ask | show | jobs
by vicentwu 505 days ago
RL doesn't need that much static data, it needs a lot of "good" tasks/challenges and computation.