Y
Hacker News
new
|
ask
|
show
|
jobs
by
aszen
167 days ago
Agreed it probably contributes to the model improving for all agents but crucially it is verifiably better against their own agent. So they get a good feedback loop to improve both