Hacker News new | ask | show | jobs
by Refusing23 492 days ago
benchmarks dont show the quality or 'correctness' of the response though.
1 comments

What do they show?
If you mean this particular benchmark, it shows how much people like the responses a LLM gives.