Hacker News new | ask | show | jobs
by noddybear 467 days ago
True - although it might be interesting to benchmark them both, as (1) is more about debugging (something that these agents spend a lot of time doing).