Hacker News new | ask | show | jobs
by sirnicolaz 63 days ago
Consider that SWE benchmarking is mainly done with python code. It tells something