Hacker News new | ask | show | jobs
by alexcg1 1430 days ago
You mean the PDFSegmenter Executor in the notebook?
1 comments

Yes
PDFSegmenter also extracts images, which can then be OCR'ed in the next step of the pipeline