Hacker News new | ask | show | jobs
by amitness 545 days ago
Try this trick that I learned from Cohere: - Fetch top 10*k (i.e. 100) results using the hamming distance - Rerank by taking dot product between query embedding (full precision) and binary doc embeddings - Show top-10 results after re-ranking
1 comments

This is pretty cool. The dot product would give the unnormalized cosine similarity from a smaller pool. Thank you so much!