Hacker News new | ask | show | jobs
by akane8 12 days ago
Empirical evidence that TDD actively hurts agent performance measured by ProgramBench, while drastically increasing cost