Hacker News new | ask | show | jobs
by nivekney 20 days ago
I think so, the benchmark is on a coding dataset (SPEED-Bench).