| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by m3kw9 953 days ago
	How does this absolve vectordbs

2 comments

dragonwriter 953 days ago

If you are using OpenAI, the new Assistants API looks like itnwill handle internally what you used to handle externally with a vector DB for RAG (and for some things, GPT-4-Turbo’s 128k context window will make it unnecessary entirely.) There are some other uses for Vector DBs than RAG for LLMs, and there are reasons people might use non-OpenAI LLMs with RAG, so there is still a role for VectorDBs, but it shrunk a lot with this.

link

oezi 953 days ago

OpenAI is still way too expensive to run a corporate knowledge base on top

link

m3kw9 953 days ago

It’s more reliable than chatpdfs that relies on vector search. With vector db all you are doing is doing a fuzzy search and then sending in that relevant portion near that text and send it to a LLM model as part of a prompt. It misses info.

link

dragonwriter 952 days ago

I'd be very surprised if the Assistants API is not doing RAG with a vector DB behind the scenes with the supplied files.

link

danielbln 953 days ago

It doesn't, but semantic search is a lot less relevant if you can squeeze 350 pages of text into the context.

link

quinncom 953 days ago

OpenAI charges for all those input tokens. If an app requires squeezing 350 pages of content in every request is going to cost more. Vector DB still relevant for cost and speed.

link

gk1 953 days ago

Besides the cost factor, stuffing the context window can actually make the results worse. https://www.pinecone.io/blog/why-use-retrieval-instead-of-la...

link