Hacker News new | ask | show | jobs
by nanidin 816 days ago
What you want to look into is retrieval augmented generation (RAG). Basically you store a bunch of facts as text in a vector database, then you query the vector DB for things to stuff into the prompt that you send to the LLM.

What you write to the LLM might be "give me a conversation topic I might have with a member of the community", and your vector DB will have a bunch of things like "Dumbledore is a member of the community. Dumbledore likes to talk about magic". In theory the vector DB search will turn that up, and then it gets stuffed into the prompt so what actually goes to the LLM is something more like "You are a helpful AI assistant. Dumbledore is a member of the community. Dumbledore likes to talk about magic. give me a conversation topic I might have with a member of the community".

The heavy lifting will then be collecting and storing all of the data that goes into the vector DB.

1 comments

Thanks a lot. Do you happen to know of any products that make it easy to implement this without being too technical?
The various langchain libraries out there make it easy for developers, and they include examples.