| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by rvz 476 days ago

> TL:DR: So, how's it doing? Well, pretty badly. Worse than a 6-year-old would, definitely not PhD-level.

This is what happens when you try to apply hype technology (LLMs) on to every problem, especially with a company that has amassed too much hype too quickly.

The limits of said technology tell us that Claude has a very limited memory to plan in the game which is why it is obviously struggling. But expanding those limitations would cost Anthropic an enormous amount of money and compute even if they did that.

So you can clearly see that if LLM are unable to beat this game in an efficient manner to test for planning and reasoning, what hope is there for it with much challenging and complex scenarios which is required for so-called "AGI"?

The most important sentence in this article is this:

>> ...some new paradigm is yet required for them to be right.

1 comments

YeGoblynQueenne 476 days ago

Yeah, hope dies last.

>> ClaudePlaysPokémon is proof that the last 6 months of AI innovation, while incredible, are still far from the true unhobbling necessary for an AI revolution. That doesn't mean 2-year AGI timelines are wrong, but it does feel to me like some new paradigm is yet required for them to be right.

The predictions aren't wrong! They're just nowhere near being right yet!

link