|
|
|
|
|
by nanidin
816 days ago
|
|
What you want to look into is retrieval augmented generation (RAG). Basically you store a bunch of facts as text in a vector database, then you query the vector DB for things to stuff into the prompt that you send to the LLM. What you write to the LLM might be "give me a conversation topic I might have with a member of the community", and your vector DB will have a bunch of things like "Dumbledore is a member of the community. Dumbledore likes to talk about magic". In theory the vector DB search will turn that up, and then it gets stuffed into the prompt so what actually goes to the LLM is something more like "You are a helpful AI assistant. Dumbledore is a member of the community. Dumbledore likes to talk about magic. give me a conversation topic I might have with a member of the community". The heavy lifting will then be collecting and storing all of the data that goes into the vector DB. |
|