Hacker News new | ask | show | jobs
by blainm 453 days ago
So when it really struggled to get around (kept just walking into obstacles), they gave Claude the ability to navigate by adding pathfinding and awareness of its position and map ID. However, it still struggles, particularly in open-ended areas.

This suggests a fundamental issue beyond just navigation. While accessing more RAM data or using external tools using said data could improve consistency or get it further, that approach reduces the extent to which Claude is independently playing and reasoning.

A more effective solution would enhance its decision-making without relying on direct RAM access or any kind of fine tuning. I'm sure it's possible.

There has to be a better approach, and also in a way that's not relying on reading values from RAM or any kind of fine tuning.

2 comments

It can't do a good job of reasoning about higher-level abstractions in its long term memory without making poor decisions about which memory items to retain and which to forget.

Would a mixture-of-experts paradigm, where each expert weights the value of short-term memories differently to the weight of long-term memoried, do noticeably better at overcoming that one category of roadblocks?

Seems like the 200k context window is a huge issue and it's summarization deletes important information leading it to revisit solved areas even when it's working properly or simply forget things it needs to progress.