Hacker News new | ask | show | jobs
by hermitcrab 1930 days ago
Tangentially, I would like to be able to extract tables from PDF files for my Easy Data Transform software. So I would like to find a C++ library that does this. Can anyone recommend one? Needs to work with proprietary software (so no GPL). Doesn't have to be free. And what is the state of art on this? How reliably can data tables be extract from real world PDFs?