Hacker News new | ask | show | jobs
by wmf 21 days ago
At least there shouldn't be any complaints about benchmaxing this time.
1 comments

Just because it is performing rather poorly by comparison, it doesn’t mean it isn’t benchmaxxed. It can still be worse than it appears.
It isn't benchmaxxed because they are using human preference as an evaluation.