Hacker News new | ask | show | jobs
by woopwoop 510 days ago
They aren't proofs, they're just numbers. All the questions have numerical answers. That's how they're evaluated.
1 comments

I think those reasoning models are smart enough to not emit memorized answer if they can't come with CoT proof.

But OAI could draw any result, no one was checking, they probably were not brave enough to declare math as solved topic.