|
|
|
|
|
by obblekk
201 days ago
|
|
80% on swebench verified is incredible. a year ago the best model was at ~30%. i wonder if we'll soon have a convincingly superhuman coding capability (even in a narrow field like kernel optimization). this is the most interesting time for software tools since compilers and static typechecking was invented. |
|