OpenAI funded independent math benchmark before setting record with o3

Y	Hacker News new \| ask \| show \| jobs

	OpenAI funded independent math benchmark before setting record with o3 (the-decoder.com)
	56 points by rar00 513 days ago

5 comments

andrepd 513 days ago

> They also made a verbal agreement with OpenAI that prohibits the company from using the materials to train their models

Hilarious.

link

aithrowawaycomm 513 days ago

Elliot Glazer seems to have been caught in a contradiction: https://xcancel.com/ElliotGlazer/status/1880809468616950187

Here he says that Epoch is "developing" a private test set that OpenAI doesn't have access to, but elsewhere Epoch strongly implied that this already existed. This kind of makes me lean towards "Epoch AI lied" instead of "Epoch AI got played." (Even the coauthors weren't informed about the funding, so Epoch does not deserve a presumption of good faith.)

I guess the real question: o3 was able to solve 25% of Frontier problems, so were these the problems whose solutions OpenAI had access to? If so, then that score is meaningless and dishonest.

link

ChrisArchitect 513 days ago

[dupe] Discussion on source:

https://news.ycombinator.com/item?id=42763231

link

nioj 513 days ago

link

Frederation 513 days ago

Cant trust anyone, ever.

link

plsbenice34 512 days ago

I think it's more useful to assume everyone else is less blatantly deceptive than OpenAI, at least, so there's some hierarchy of trustworthiness to help lead you to the truth.

link