| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by SwellJoe 14 days ago
	Everybody builds one. And, then they usually figure out that making the model fill its context with a bunch of memories hurts performance more often than it helps.

1 comments

esafak 14 days ago

That's why I always ask: got benchmarks?

link

zaydmulani 14 days ago

Yes — cargo run -p mnemo-bench. Ships with 12 benchmarks. Full retrieval pipeline is ~4ms on debug build. Numbers are in the README performance table.

link

SwellJoe 14 days ago

I don't care if it's fast, if it makes the model dumber by cluttering up context.

link