Hacker News new | ask | show | jobs
by OgsyedIE 453 days ago
It can't do a good job of reasoning about higher-level abstractions in its long term memory without making poor decisions about which memory items to retain and which to forget.

Would a mixture-of-experts paradigm, where each expert weights the value of short-term memories differently to the weight of long-term memoried, do noticeably better at overcoming that one category of roadblocks?