Hacker News new | ask | show | jobs
by JonathanChavez 571 days ago
Hey HN, I built this site to compare provider metrics and benchmark results between models.

All the data is open and with references. Here: https://github.com/JonathanChavezTamales/LLMStats

I hope this is useful. It was created using Sonnet 3.5 + o1 + Cursor.

Let me know if you have any feedback! Thanks.

PS:

It's hard to compare providers' quality because they use different precision at inference. Also, some labs cherry pick the benchmarks they want to report for their models. Medium term goal is to run the evals myself.