Y
Hacker News
new
|
ask
|
show
|
jobs
by
grandinquistor
64 days ago
looking at the system card for opus 4.7 the MCRC benchmark used for long context tasks dropped significantly from 78% to 32%
I wonder what caused such a large regression in this benchmark