|
|
|
|
|
by riku_iki
515 days ago
|
|
> OpenAI to have gamed ARC-AGI by seeing the first few examples not just few examples. o3 was evaluated on "semi-private" test, which was previously already used for evaluating OAI models, so OAI had access to it already for a long time. |
|