Hacker News new | ask | show | jobs
by TruthWillHurt 860 days ago
I think RAG approach with Vector DB is more likely. Just like when you add a file to your prompt / custom GPTs.

Adding the entire file (or memory in this case) would take up too much of the context. So just query the DB and if there's a match add it to the prompt after the conversation started.

1 comments

These "memories" seem rather short, much shorter than the average document in a knowledge base or FAQ, for example. Maybe they do get compressed to embedding vectors, though.

I could imagine that once there's too many, it would indeed make sense to classify them as a database, though: "Prefers cats over dogs" is probably not salient information in too many queries.