Hacker News new | ask | show | jobs
by Alifatisk 498 days ago
Is there a good benchmark one can look at that shows the best performing llm in terms of instruction following or overall score?

The only ones I am aware of is benchmarks on Twitter, Chatbot Arena [1] and Aider benchmark [2]

1. https://huggingface.co/spaces/lmarena-ai/chatbot-arena-leade...

2. https://aider.chat/docs/leaderboards