| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by m1sta_ 1175 days ago
	Cam you help me understand this? The research appears to be from a few years ago. Can this be used with Claude (for example)? How is it different to the approach many people are taking with vector stores and embeddings?

2 comments

pyth0 1175 days ago

Other people seem to be suggesting that the user would do the retrieval of the relevant parts of the book from a vectordb first, and then feed those sections along with the question as the prompt. Conceptually it is very similar (and it too uses vector database), but with RAG it would happen as part of the inferencing pipeline and therefore achieve better performance than the end user emulating it.

link

ukuina 1174 days ago

Yep, but your retrieval from the vector DB becomes your relevancy bottleneck.

link

make3 1175 days ago

it's not different. RAG is a way to train embedding stores end to end

link

make3 1175 days ago

somehow got down voted on something I'm a professional expert at

link