|
|
|
|
|
by goosejuice
507 days ago
|
|
Benchmarks are great to have but individual/org experiences on specific codebases still matter tremendously. If an org consistently finds one model performs worse on their corpus than another, they aren't going to keep using it because it ranks higher in some set of benchmarks. |
|