|
|
|
|
|
by fourpostmaun2
479 days ago
|
|
The entire premise of this paper is false. They claim that the "hints_text" is used and leaks the answer in Section 2.1.1; however, the authors of SWE-Bench themselves state that this is not used anywhere (Issue #133 on the official SWE-Bench GitHub). According to the paper: > 1. Solution leak: represents instances where the solution to the issue is clearly outlined in the issue
description or comments on GitHub. Since both the issue descriptions and comments (referred to
as hints_text in the SWE-Bench study) are provided as input to the models, these LLM models can
extract the solutions directly from this information instead of generating it independently. And yet, the SWE-Bench authors themselves explicitly state: > In short, for participating on the SWE-bench leaderboard, using hints_text in any manner is not allowed. Although we don't explicitly say this in the original paper, we also do not make any mention of using the hints_text anywhere. So, it's a made up issue that would only occur if you deviated from the paper implementation and explicitly added a field called "hints" that isn't used anywhere. |
|
[1] Don't ask me why they cited the issue number, 16669, instead of the pull request number, 16766, when only the latter appears in the dataset. This confused me for a bit.