Hacker News new | ask | show | jobs
by vectorhacker 503 days ago
Yeah, I no longer consider the SWE-bench useful because these models can just "memorize" the solutions to the PRs.