Hacker News new | ask | show | jobs
by saddino 4822 days ago
Wow, nice work! I'm the author Trapeze, a once-shareware (now freeware and open source) PDF-to-Word/RTF/HTML/PlainText application for OS X. My approach was similar: trying to squash characters into words via a logical grid to determine whitespace. My #1 request from customers was to extract tables and I never had the guts to attempt it. :-)

(For those interested, you can grab Trapeze from mesadynamics.com -- requires OS X 10.4; source code is a mixture of C++ and Objective-C).