Y
Hacker News
new
|
ask
|
show
|
jobs
user:
qwesr123
created:
2025-06-13
karma:
324
marginlab.ai
submissions:
Claude Code Degraded Before Opus 4.8 Release
8 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
Claude Code daily benchmarks for degradation tracking
760 points
|
354 comments
No one is evaluating AI coding agents in the way they are used
1 points
|
0 comments
0 points
|
0 comments
Claude Code Daily Degradation Tracker
3 points
|
3 comments
Anatomy of a Coding Agent: A step-by-step illustration
3 points
|
0 comments
How are coding assistants evaluated? SWE-Bench Pro Explorer
2 points
|
0 comments
0 points
|
0 comments
0 points
|
0 comments
SWE-Bench: The $500B Benchmark
5 points
|
0 comments
Show HN: Database of foods likely to trigger IBS
4 points
|
0 comments