Hacker News new | ask | show | jobs
by aaln 586 days ago
Extract all the text from the PDF, turn the pdf into images, send the text for each page along with the image to an LLM with a desired output strucutre.
1 comments

You are not doing any of the fancy table extractor stuff?