|
|
|
|
|
by ryohkyo
1716 days ago
|
|
In concept each 16-digit chromosome would work as the nearest neighbors, and finding “a range” of the nearest neighbors would be very fast. In a data scientist’s perspective, this may seem to be a hack. I would be very appreciated to learn more from you about this. I can tell you that the weakness on this method is the multiple writes to the database. I assume vector database can gets this implemented with less writes. In the past, I have used this for supervised training and yielded very good results. However, I think this would be inefficient in large scale networks. I am planning to use Go + Clickhouse to improve the performance in the next project. |
|
You have 4 out of 6 open source DBs, and 2 commercial ones give you the managed service. All 6 can scale to quite large numbers of vectors.
My goal is to systematically study each DB through the lens of a specific search task, which will not be (only) text based.
If you think we could collaborate in some way on your dataset, that would be fantastic and probably a learning experience to both sides.