Y
Hacker News
new
|
ask
|
show
|
jobs
by
vectorhacker
503 days ago
Yeah, I no longer consider the SWE-bench useful because these models can just "memorize" the solutions to the PRs.