Hacker News new | ask | show | jobs
by dvfjsdhgfv 369 days ago
The Hanoi Towers example demonstrates that SOTA RLMs struggle with tasks a pre-schooler solves.

The implication here is that they excel at things that occur very often and are bad at novelty. This is good for individuals (by using RLMs I can quickly learn about many other aspects of human body of knowledge in a way impossible/inefficient with traditional methods) but they are bad at innovation. Which, honestly, is not necessarily bad: we can offload lower-level tasks[0] to RLMs and pursue innovation as humans.

[0] Usual caveats apply: with time, the population of people actually good at these low-level tasks will diminish, just as we have very few Assembler programmers for Intel/AMD processors.

1 comments

> The Hanoi Towers example demonstrates that SOTA RLMs struggle with tasks a pre-schooler solves.

Find me one that can solve it entirely in their head without touching the actual thing and externalizing state.