Hacker News new | ask | show | jobs
by jonbaer 1723 days ago
I feel like I have been talking about LSH for years
1 comments

I recall Uber using it in a white paper writeup.

Is the speedup that remarkable? I'd be curious at to the increase in speed versus loss of precision.

I view ANN search for retrieval as clear winner in retrieval methods for domains with very large number of items like O(10 million +) that you want to high amount searches. I've worked at a couple different major tech companies and I'd consider two tower models + ANN as a classic pair. One tower for request embedding called on every request. One tower for item embedding. Compute all item embeddings periodically and build an ANN index. The top dot product of the two with minor constraints can be done by just running request tower and then doing search in the index.

The speed up is really necessity as direct methods are just too expensive both dollar wise and time wise (lowish latency is goal).

Yes. Storage also. You can get >99% ordering quality of exhaustive cosine with a tiny fraction of memory usage