Hacker News new | ask | show | jobs
by angelpan 27 days ago
Curious how much of the code progress story is post training investment vs code being uniquely suited to RL with verifiable rewards