Hacker News new | ask | show | jobs
by iamsyr 88 days ago
Coding evaluation : Claude Opus 4.6 : 47.9 GLM 5.1 : 45.3 GLM 5 : 35.4
1 comments

What benchmark is "Coding Evaluation"?