Hacker News new | ask | show | jobs
by Jensson 263 days ago
> In my experience, it's 100%. Not 95%, not 99%.

Yeah, they seem to be there on high school math problems today, there aren't that many variations on them and there are billions of examples of data on them so LLM can saturate those.

Just don't assume they are this reliable on solving real world math tasks yet, those are more varied still and stump models.

1 comments

They did well at the International Mathematical Olympiad this year.