Y
Hacker News
new
|
ask
|
show
|
jobs
by
dangoodmanUT
76 days ago
Because the model labs can RL against their own scaffold, which will be borderline unbeatable