|
|
|
|
|
by jks
442 days ago
|
|
You probably mean the USAMO 2025 paper. They updated their comparison with Gemini 2.5 Pro, which did get a nontrivial score. That Gemini version was released five days after USAMO, so while it's not entirely impossible for the data to be in its training set, it would seem kind of unlikely. https://x.com/mbalunovic/status/1907436704790651166 |
|