Hacker News new | ask | show | jobs
by matthewshere 441 days ago
You're absolutely right, PDFs can be incredibly tricky. That lack of a consistent, easily parsable structure for arbitrary data is the core challenge.