Hacker News new | ask | show | jobs
by az226 238 days ago
How many times have you needed to reset the optimizer during the RL training cycles?