Hacker News new | ask | show | jobs
by lyu07282 276 days ago
Thanks for doing the math! I suppose if we are charitable in practice we would of course index and only offload partially to VRAM (FAISS does that with IVF/PQ and similar).