Hacker News new | ask | show | jobs
by prmph 807 days ago
Then how can you have confidence that it actually fixes the bug? It means you still need a human to review the fix, no?
1 comments

The developer written testcases are provided in SWE-bench-lite so those could be used to check the generated patches.

The auto-generated patches are to reduce the effort of resolving issues. In practice, they should be reviewed and verified by human developers before they are integrated.