Hacker News new | ask | show | jobs
by cmcollier 891 days ago
As a general rule (for now), you'll get the best search result for your dev time, with straight ahead BM25 via a js lib.

In terms of overhead, with lower doc counts there's not much overhead with embeddings and knn/ann. Imagine 384 floats per doc or whatever embedding size. At scale it becomes more problematic and less comparable.

With all that said, messing around with vector ops and WASM sounds more fun :)

1 comments

Thanks for the feedback!