Hacker News new | ask | show | jobs
by jacquesm 1931 days ago
This is really neat. A lot of the hard bits in converting scientific pdfs to text is to deal with the tables, which more often than not are graphical and usually do not have a text overlay.