Hacker News new | ask | show | jobs
by coffeeri 139 days ago
There is https://artificialanalysis.ai
2 comments

There are many lists, but I find all of them outdated or containing wrong information or missing the actual benchmarks I'm looking for.

I was thinking, that maybe it's better to make my own benchmarks with the questions/things I'm interested in, and whenever a new model comes out run those tests with that model using open-router.

Thank you! Exactly what I was looking for