|
|
|
|
|
by vannevar
694 days ago
|
|
>throw out vector DB and embeddings into the trashcan (they are pulling junk information into the context and causing hallucinations) Not sure why this would be true. In my experience, semantic search based on a vector index/embeddings pulls in more relevant information than a full-text keyword search. Maybe there is too broad a set of materials in your vector db, or the chunking strategy isn't good? |
|
My problem with similarity search - it is unpredictable. It can sometimes miss really obvious matches or pull completely irrelevant snippets. When this happens - this causes downstream hallucinations that are hard to fix.
My customers don’t tolerate hallucinations.
Query expansion with FTS search works more predictably for me. Especially, if we factor in search scope reduction driven by the request classifier (“agent router”)