Hacker News new | ask | show | jobs
by alargemoose 120 days ago
I don’t care how practical it may or may not be, this is my new favorite LLM benchmark