Hacker News new | ask | show | jobs
by i_have_an_idea 10 days ago
Just because it is performing rather poorly by comparison, it doesn’t mean it isn’t benchmaxxed. It can still be worse than it appears.
1 comments

It isn't benchmaxxed because they are using human preference as an evaluation.