Hacker News new | ask | show | jobs
by 0x5FC3 32 days ago
Interesting. OpenAI could also be trying to solve other problems, but Erdos problems maybe falling first?
1 comments

No, Erdos problems were accepted as sort of a benchmark. There's a bunch of reasons they're favorable for this task:

1. They have a wide range of difficulties. 2. They were curated (Erdos didn't know at first glance how to solve them). 3. Humans already took the time to organize, formally state, add metadata to them. 4. There's a lot of them.

If you go around looking for a mathematics benchmark it's hard to do better than that.

I'm curious how the relative difficulty between the problems can be assessed when no one knows how to solve any of them.