Hacker News new | ask | show | jobs
by bootsmann 1067 days ago
You serve the embedding model in a lambda and then run something like FAISS in the backend.