Hacker News new | ask | show | jobs
by SergeAx 1184 days ago
Pdf is a very unfortunate format. It is proprietary, it is paper-oriented, its almost single goal is to keep precise printing layout. But for the last 30 years world didn't come up with anything that could compete.
1 comments

PDF isn't the actual problem in this particular case. The documents here are photographs taken at different camera angles, embedded in PDFs.
I was going to say, using alt drag to select vertical columns is usually how I extract useable tables out from pdfs with embedded tables.