Hacker News new | ask | show | jobs
by xeon06 490 days ago
Wow, I've been doing some PDF parsing at work and this is going to come in SO handy.
1 comments

Was mentioned in this thread, but I can also endorse qpdf as being a great library.

It gives you a JSON representation of the PDF data structure. What's nice is that doesn't hide the underlying format but it takes care of a lot of the low level edge cases for you.