|
|
|
|
|
by nickwatson
410 days ago
|
|
Good question — it's pretty straightforward right now: I pass the collected content chunks (with their original URLs attached) into Gemini 2.5 Pro, asking it to synthesize a balanced report and to inline citations throughout. So it's not doing anything fancy like dynamic retrieval or classic RAG architecture. Basically:
- The agent gathers sources (webpages, PDFs, Reddit, etc.)
- Summarises each as it goes (using a cheaper model)
- Then hands a bundle of summarised + raw content to Gemini 2.5 Pro
- Gemini 2.5 Pro writes the final report, embedding links directly as citations with [1], [2], etc style citations throughout. Reverse-RAG is something I for sure want to implement. Once I can afford a better computer to run this with at scale. Even an 8B model will take overnight to summarize an average piece of content for me right now! But I'm also keeping an eye on the pace of which AI moves in the larger LLM space. The size and abilities of likes of Gemini 2.5 Pro context windows are pretty crazy these days! Thanks for the question. |
|