Hacker News new | ask | show | jobs
by iandanforth 1225 days ago
Can you say more? Usually projects that gravitate to SQLlite are not those that require massive scale and a FAISS index of a few GB covers a lot of documents.
2 comments

My dataset is going to be around 10M documents. With OpenAI embeddings, that will be around 62GB. AFAIK SQLite should be able to handle that size, but I haven't tried.

This is not going to be my primary DB. I would update this maybe once a day and the update doesn't have to be super fast.

you might check out some vector databases:

https://milvus.io/

AND

pinecone.io

there are others too