Hacker News new | ask | show | jobs
by ps173 437 days ago
How did you assign points to llms. I feel like we can elaborate on meterics. Beside that this is amazing
1 comments

Author here - it's based on finishing positions (so it's not winner-take-all) and then TrueSkill by Microsoft (https://trueskill.org/). It's basically a multiplayer version of Elo that's used in chess and other two-player games.