Hacker News new | ask | show | jobs
by rzmmm 2 days ago
The ranking is not comparable across time like that.
1 comments

I'm using the current ELO of the models, and both are still running in the arena.