Y
Hacker News
new
|
ask
|
show
|
jobs
by
wmf
21 days ago
At least there shouldn't be any complaints about benchmaxing this time.
1 comments
i_have_an_idea
21 days ago
Just because it is performing rather poorly by comparison, it doesn’t mean it isn’t benchmaxxed. It can still be worse than it appears.
link
wasabi991011
21 days ago
It isn't benchmaxxed because they are using human preference as an evaluation.
link