Hacker News new | ask | show | jobs
by gerstep 302 days ago
Cool direction! benchmarking agent library usage instead of pure codegen is what’s missing