|
|
|
|
|
by ceroxylon
72 days ago
|
|
I have been thinking that these SWE benchmarks will continue to improve since these companies hire very intelligent software engineers, they can task a multitude of them to solve problems, and then train the model on those answers. Data has always been the core of it all, onward to the next abstraction, I suppose. |
|