Y
Hacker News
new
|
ask
|
show
|
jobs
by
samusiam
78 days ago
These OSS model makers need to stop benchmarking against old models. Showing how it performs against Opus 4.5, GLM-5 when we have Opus 4.6 and GLM-5.1 just tells me that it's not comparable to SOTA.