Hacker News new | ask | show | jobs
by TeMPOraL 369 days ago
> The Hanoi Towers example demonstrates that SOTA RLMs struggle with tasks a pre-schooler solves.

Find me one that can solve it entirely in their head without touching the actual thing and externalizing state.