Hacker News new | ask | show | jobs
by Onavo 584 days ago
What's the PDF parsing like?
1 comments

Extract all the text from the PDF, turn the pdf into images, send the text for each page along with the image to an LLM with a desired output strucutre.
You are not doing any of the fancy table extractor stuff?