Y
Hacker News
new
|
ask
|
show
|
jobs
by
SwellJoe
14 days ago
Everybody builds one. And, then they usually figure out that making the model fill its context with a bunch of memories hurts performance more often than it helps.
1 comments
esafak
14 days ago
That's why I always ask: got benchmarks?
link
zaydmulani
14 days ago
Yes — cargo run -p mnemo-bench. Ships with 12 benchmarks. Full retrieval pipeline is ~4ms on debug build. Numbers are in the README performance table.
link
SwellJoe
14 days ago
I don't care if it's fast, if it makes the model dumber by cluttering up context.
link