|
|
|
|
|
by anentropic
1579 days ago
|
|
> I saw elastic has vector indexes now -- what does 'well' mean? Funny enough I was just reading about this today as a result of the OP article A commenter here, who apparently works for the pinecone.io vector db platform, states that: "Pre-filtering in most solutions (eg, Elasticsearch on AWS) requires using an inefficient brute-force search (kNN) through the remaining vectors after they've been filtered, because the original index was built on the unfiltered list and would no longer be useful. This causes sky-high search latencies." https://www.reddit.com/r/MachineLearning/comments/po3pos/com... |
|
In all honesty if you are already using ES and you just want nearest-neighbor search for less than 10M documents, just stay with ES.
Things get less obvious when you grow past 10M documents and still want low latency. Or if you need live index updates without downtime, or if you want to apply metadata filters to nearest-neighbor searches.
If you have 100M documents -- not a difficult threshold if you're an enterprise software company or a popular consumer app -- then ES gets ruled out fairly early in the process. We get a lot of those exasperated teams coming to Pinecone after trying their best with ES/OpenSearch.