Hacker News new | ask | show | jobs
by oezi 310 days ago
I meant scaling the base training before RL.