|
|
|
|
|
by sebzim4500
584 days ago
|
|
>Surprisingly, prediction markets [1] are putting 62% on AI achieving > 85% performance on the benchmark before 2028. Or they know the ancient technique of training on the test set. I know most of the questions are kept secret, but they are being regularly sent over the API to every LLM provider. |
|
Just letting the AI train on its own wrong output wouldn't help. The benchmark already gives them lots of time for trial and error.