Hacker News new | ask | show | jobs
by amarcheschi 188 days ago
The benchmark of swe places it in a comparable score with respect to open models and just a few points below the top notch models though