|
|
|
|
|
by discordance
844 days ago
|
|
RAG: 1. First you create embeddings from your documents 2. Store that in a vector db 3. Ask what the user wants and do a search in the vector db (cosine similarity etc) 4. Feed the relevant search results to your LLM and do the usual LLM stuff with the returned embeddings and chunks of the documents |
|
Would you define RAG only as 'prompt optimisation that involves embeddings'?