|
|
|
|
|
by alach11
1040 days ago
|
|
The simplest way, as rdedev is describing, is to do Retrieval Augmented Generation (RAG) in your prompting. This would require the addition of a vector database and a text embedding model. There are many open source / local / private options for that. The steps would then be:
1. Embed your private data in chunks and store the resulting embeddings in a vector database
2. In your prompting workflow, when a user queries the chat model, embed their query using the embedding model
3. Retrieve the most similar chunks of text from your vector database based on cosine similarity
4. In the chat response, provide it the context of those chunks of text For example, if you asked "who have I discussed Ubuntu with?", it might retrieve emails that have similar content. Then the model will be able to answer informed by that context. |
|