Y
Hacker News
new
|
ask
|
show
|
jobs
by
qwesr123
144 days ago
FYI the MarginLab Claude Code degradation tracker is showing a statistically significant ~4% drop in SWE-Bench-Pro accuracy over the past month