A Bitter Lesson for Memory

Y	Hacker News new \| ask \| show \| jobs

	A Bitter Lesson for Memory (personal-website-3bed.onrender.com)
	4 points by wenhan_zhou 2 days ago

2 comments

wgd 2 days ago

I've always been amazed at how terrible most frontier LLMs are at compaction given how embarrassingly easy it is to come up with half a dozen different RL training evals which would teach models to generate useful context summaries. Heck, you could bolt it onto any existing RL eval by just forcing a compaction every three turns.

link

wenhan_zhou 2 days ago

Yep. Or even better, compact after a random number of turns. The model must then learn to preserve useful context at arbitrary context lengths.

link

wenhan_zhou 2 days ago

If understanding emerges from pre-training, then perhaps memory is what emerges from post-training.

link