Y
Hacker News
new
|
ask
|
show
|
jobs
by
zamadatix
322 days ago
"+100 points" sounds like a lot until you do the ELO math and see that means 1 out of 3 people still preferred Claud Opus 4's response. Remember 1 out of 2 would place the models dead even.