| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dchuk 1230 days ago
	When using SBERT instead of gpt for this use case, is it paired with some sort of vector database or just all done in code/memory?

1 comments

leobg 1230 days ago

You’d want persistence, since the embedding process takes some time. But you don’t need to go all Pinecone on this. There is FAISS, and there is hnswlib, for example. Like SQLite for vector search.

link

gk1 1230 days ago

Friendly reminder that we (Pinecone) have a free tier that holds up to ~5M SBERT embeddings (x768 dimensions). For quick projects, going "all Pinecone on this" could turn out to be the easier and faster option.

link

leobg 1230 days ago

Point taken ;-)

I like to stand up for the little guy. I hear Pinecone this and Pinecone that. And nobody seems to pay any attention to the awesome dude who made hnswlib.

link

gk1 1230 days ago

Who, Yury Malkov? He won’t be offended… He’s an advisor to Pinecone. :)

And yes, both he and HNSW are awesome.

link

moneywoes 1229 days ago

What about Postgres with pg vector?

link