Y
Hacker News
new
|
ask
|
show
|
jobs
by
encroach
177 days ago
This is true, however LMArena does employ some methods to mitigate attempts to manipulate the leaderboard, see
https://openreview.net/forum?id=zf9zwCRKyP
They also control for style
https://news.lmarena.ai/sentiment-control/