Hacker News new | ask | show | jobs
by Wiretrip 2296 days ago
Totally agree. I worked on a project that had to try and extract tables from PDFs. It is much harder that it would first appear.
1 comments

Detecting where tables are is still an active research areas. Once we know where on the page, it’s easier to parse out their structure.