| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jahooma 590 days ago
	Ah yeah, that's what I mean! I thought RAG is synonymous with this vector search approach. Either way, we do the search step a little different and it works well.

2 comments

cratermoon 590 days ago

Any kind of search prior for content to provide as context to the LLM prompt is RAG. The goal is to leverage traditional information retrieval as a source of context. https://cloud.google.com/use-cases/retrieval-augmented-gener...

I'm currently working on a demonstration/POC system using my ElasticSearch as my content source, generating embeddings from that content, and passing them to my local LLM.

link

petesergeant 590 days ago

It would be cool to be talking to other people about the RAG systems they’re building. I’m working in a silo at the moment, and pretty sure that I’m reinventing a lot of techniques

link

petesergeant 590 days ago

I didn't mean to be down on it, and I'm really glad it's working well! If you start to reach the limits of what you can achieve with your current approach, there are lots of cute tricks you can steal from RAG, eg nothing stopping you doing a fuzzy keyword search for interesting-looking identifiers on larger codebases rather than giving the LLM the whole thing in-prompt, for example

link