|
|
|
|
|
by npinsker
196 days ago
|
|
Completely false. This is like saying being good at chess is equivalent to being smart. Look no farther than the hodgepodge of independent teams running cheaper models (and no doubt thousands of their own puzzles, many of which surely overlap with the private set) that somehow keep up with SotA, to see how impactful proper practice can be. The benchmark isn’t particularly strong against gaming, especially with private data. |
|