Hacker News new | ask | show | jobs
Embedding Quantization: 25-45x retrieval speedup, 32x or 4x less memory usage (huggingface.co)
4 points by cubie 815 days ago