Hacker News new | ask | show | jobs
by TedPetrou 1297 days ago
The tabula-py Python library, a wrapper around tabula(https://tabula.technology/), can extract tables within PDFs and convert them to Pandas DataFrames. It's a great little library and this is a blog showcasing an example.