Hacker News new | ask | show | jobs
by laen 1180 days ago
Can you elaborate on how you parse the PDF? Are you simply converting it to text using a python library or something more robust like GROBID[1]?

1: https://github.com/kermitt2/grobid

2 comments

Do you know of anything that can process engineering drawings and diagrams by looking how lines link text and other objects?
Not the OP but I that's what I do.