|
|
|
Ask HN: Bad Results from Vector Search?
|
|
2 points
by tuckerconnelly
988 days ago
|
|
Hi, I'm building an RAG system with OpenAI embeddings and the ChatGPT api. I chunked all the documents into 400-800 character chunks, vectorized them all, and put them in a vector database. The results are pretty bad--the surfaced document chunks kind-of-but-not-really match up with the query. I'm getting much better results from simple keyword searches (using meilisearch). Am I doing something wrong? Do I need to use a fine-tuned model like BERT? Is this technology vastly overhyped? |
|
https://sbert.net/
My take is you might like keyword searches better for some queries and you might like embedding search for others.
The problems of: (1) How to combine keyword search and embedding search (you'd imagine you'd want a ranking function that handles both) and (2) How to handle chunks are both hard.
As for (2) you probably want to make the chunks as big as you practically can, you should be chunking on tokens instead of characters if you at all can.
With the chunks of course you don't get a score for the query-document relationship you get the query-chunk score instead which isn't quite the score you really want, aggregating all the chunk hits and properly chunking the data is an open problem to say the least.