Hacker News new | ask | show | jobs
by krakaukiosk 3398 days ago
Tabula is a great tool. In my experience it's the most reliable open source software for extracting tables from PDFs. We are using their underlying Tabula-Java library for some parts of https://docparser.com and are happily sponsoring their project.