| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by TruthWillHurt 860 days ago
	I think RAG approach with Vector DB is more likely. Just like when you add a file to your prompt / custom GPTs. Adding the entire file (or memory in this case) would take up too much of the context. So just query the DB and if there's a match add it to the prompt after the conversation started.

1 comments

lxgr 860 days ago

These "memories" seem rather short, much shorter than the average document in a knowledge base or FAQ, for example. Maybe they do get compressed to embedding vectors, though.

I could imagine that once there's too many, it would indeed make sense to classify them as a database, though: "Prefers cats over dogs" is probably not salient information in too many queries.

link