Hacker News new | ask | show | jobs
by andrewio 1011 days ago
To extract tables from PDFs, you can use the following tools:

1. Tabula (https://tabula.technology): a free and open-source tool.

2. Parsio (https://parsio.io): uses pre-trained AI models for data extraction from PDFs, emails, and other formats.

3. Airparser (https://airparser.com): uses GPT approach similar to ChatGPT for data extraction from PDFs, emails, and other formats.