|
|
|
|
|
by famouswaffles
561 days ago
|
|
Changing numerical values doesn't do anything to impact the performance of state of the art models (4o, o1-mini, preview) The only thing that does is the benchmark that introduces "seemingly relevant but ultimately irrelevant information" |
|