Hacker News new | ask | show | jobs
by burkaman 140 days ago
It's actually 80% against Opus, 66% average against the 5 models it's tested with.