Y
Hacker News
new
|
ask
|
show
|
jobs
by
i_have_an_idea
10 days ago
Just because it is performing rather poorly by comparison, it doesn’t mean it isn’t benchmaxxed. It can still be worse than it appears.
1 comments
wasabi991011
10 days ago
It isn't benchmaxxed because they are using human preference as an evaluation.
link