Hacker News new | ask | show | jobs
by cubefox 636 days ago
They are already above average human level on many tasks, like math benchmarks.
4 comments

They really aren't better than humans at math or logic, they are good at the benchmarks because they are hyper optimized for the benchmarks lol. But if you ask LLMs simple logical questions they still get them wrong all the time
Yes, there are certain tasks they're great at, just as AI has been superhuman in some tasks for decades.
But now they are good or even great at way more tasks than before because they can understand and use natural languages like English.
Yeah, and they're still under delivering to their hype and the improvements have vastly slowed down.
So are calculators …
If you ignore the part where there proofs are meandering drivel, sure.
Even if you don't ignore this part they (e.g. o1-preview) are still better at proofs than the average human. Substantially better even.