Hacker News new | ask | show | jobs
user: qwesr123
created: 2025-06-13
karma: 324

marginlab.ai

submissions:

Claude Code Degraded Before Opus 4.8 Release
8 points | 0 comments
0 points | 0 comments
0 points | 0 comments
Claude Code daily benchmarks for degradation tracking
760 points | 354 comments
No one is evaluating AI coding agents in the way they are used
1 points | 0 comments
0 points | 0 comments
Claude Code Daily Degradation Tracker
3 points | 3 comments
Anatomy of a Coding Agent: A step-by-step illustration
3 points | 0 comments
How are coding assistants evaluated? SWE-Bench Pro Explorer
2 points | 0 comments
0 points | 0 comments
0 points | 0 comments
SWE-Bench: The $500B Benchmark
5 points | 0 comments
Show HN: Database of foods likely to trigger IBS
4 points | 0 comments