| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by cubefox 636 days ago
	They are already above average human level on many tasks, like math benchmarks.

4 comments

amusedcyclist 633 days ago

They really aren't better than humans at math or logic, they are good at the benchmarks because they are hyper optimized for the benchmarks lol. But if you ask LLMs simple logical questions they still get them wrong all the time

link

lawn 636 days ago

Yes, there are certain tasks they're great at, just as AI has been superhuman in some tasks for decades.

link

cubefox 635 days ago

But now they are good or even great at way more tasks than before because they can understand and use natural languages like English.

link

lawn 635 days ago

Yeah, and they're still under delivering to their hype and the improvements have vastly slowed down.

So are calculators …

If you ignore the part where there proofs are meandering drivel, sure.

link

cubefox 636 days ago

Even if you don't ignore this part they (e.g. o1-preview) are still better at proofs than the average human. Substantially better even.

link