Hacker News new | ask | show | jobs
by m1sta_ 1128 days ago
Cam you help me understand this? The research appears to be from a few years ago. Can this be used with Claude (for example)? How is it different to the approach many people are taking with vector stores and embeddings?
2 comments

Other people seem to be suggesting that the user would do the retrieval of the relevant parts of the book from a vectordb first, and then feed those sections along with the question as the prompt. Conceptually it is very similar (and it too uses vector database), but with RAG it would happen as part of the inferencing pipeline and therefore achieve better performance than the end user emulating it.
Yep, but your retrieval from the vector DB becomes your relevancy bottleneck.
it's not different. RAG is a way to train embedding stores end to end
somehow got down voted on something I'm a professional expert at