| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by Jensson 263 days ago

> In my experience, it's 100%. Not 95%, not 99%.

Yeah, they seem to be there on high school math problems today, there aren't that many variations on them and there are billions of examples of data on them so LLM can saturate those.

Just don't assume they are this reliable on solving real world math tasks yet, those are more varied still and stump models.

1 comments

simonw 262 days ago

They did well at the International Mathematical Olympiad this year.

link