Hacker News new | ask | show | jobs
by purple-leafy 1 hour ago
Benchmarks like this are onto something. Next frontier of llm benchmarking