Hacker News new | ask | show | jobs
by BoorishBears 49 days ago
https://aibenchy.com/compare/anthropic-claude-opus-4-6-mediu...

That's not even the tip of the iceberg in how useless their benchmark is.