| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pmarreck 1130 days ago
	There are tricks you can do to better utilize the smaller context window, such as sub-summaries and attention tricks. That's how there are already products on the market that consume entire big PDF's and let you query them. Granted, a larger context window would still work better, but it's possible to do.

2 comments

yawnxyz 1130 days ago

it's using "overlapping chunking" methods and it usually works for generic PDFs. It really falls apart on technical documents, SOPs and research articles where you need to get context from chunks way above. Using vector DBs also doesn't work well bc you have to twiddle around with window size / overlappy-ness, which changes depending on what kind of paper you're uploading. It's a mess and takes too long

link

marcopicentini 1129 days ago

The problem is that making a summary of a text of 100k token costs 2$ using Davinci.

link