Y
Hacker News
new
|
ask
|
show
|
jobs
Embedding Quantization: 25-45x retrieval speedup, 32x or 4x less memory usage
(
huggingface.co
)
4 points
by
cubie
815 days ago