|
|
|
|
|
by tartakovsky
124 days ago
|
|
Well, task == Resolving real GitHub Issues Languages == Python only Libraries (um looks like other LLM generated libraries -- I mean definitely not pure human: like Ragas, FastMCP, etc) So seems like a highly skewed sample and who knows what can / can't be generalized. Does make for a compelling research paper though! |
|