| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Xenoamorphous 916 days ago
	How do RAG implementations usually get around the context size limitations in LLMs? Since it usually deals with PDFs and other docs that can be quite big, do they take only the first N tokens? Are abstractive summarisation techniques used?

1 comments

They split the document. Here’s an example of Markdown splitting. All this is far more an art than science at this point.