Hacker News new | ask | show | jobs
by bevekspldnw 26 days ago
How much of this is RL’ing a good coding model on every CVE ever?
1 comments

most it this comes from the pretrain imo. just scale + some RL = mythos