Hacker News new | ask | show | jobs
by anish_m 994 days ago
What are the SOTA benchmarks for LLMs now? Love the progress on opensource models, but would like to see an uncontaminated and objective framework to evaluate them.