Show HN: MinDB – an extremely memory-efficient vector database

Y	Hacker News new \| ask \| show \| jobs

	Show HN: MinDB – an extremely memory-efficient vector database (github.com)
	17 points by zmccormick7 679 days ago

3 comments

throwaway888abc 678 days ago

That's looks exciting. Do you guys have more detailed benchmarks (doesn't have to be polished article), pastebin welcome ?

Thank you, will keep an eye

link

zmccormick7 678 days ago

We've only done full benchmarking with the FIQA dataset, comparing minDB with Chroma. We're going to try it with Qdrant and Weaviate soon too, since they both have support for quantization, which will be a more apples-to-apples comparison with our approach.

We did test uploading and querying a Wikipedia dump, which was ~35M vectors. Query latency was around 150ms and peak memory usage was 1.5GB. We couldn't test recall, though, because we didn't have queries with ground truths.

link

pablomendes 678 days ago

Cool! What's next in the roadmap?

link

zmccormick7 676 days ago

The main thing we need to add is metadata filtering, as that's required for a lot of use cases. We're also thinking about adding hybrid search support and multi-factor ranking.

link

caeser 673 days ago

extremely efficient: python.

link