Hacker News new | ask | show | jobs
by dangoodmanUT 76 days ago
Because the model labs can RL against their own scaffold, which will be borderline unbeatable