Hacker News new | ask | show | jobs
by gk1 1054 days ago
Hey, I'm from Pinecone. What scale are we talking about? Many of our customers come to us with 500M–10B embeddings precisely because other managed solutions either ground to a halt at that scale or cost even more.

Even so, driving the cost down for large workloads like that is a priority for us. We recognize the GenAI / RAG stack is a completely new line item in most companies' budgets so anything to keep that low can help these projects move forward.