Hacker News new | ask | show | jobs
by unmei 4654 days ago
Very nice. I've been doing some table extraction from PDFs recently. Also check out PDF2JSON for nodejs-based parsing - it grabs all the texts and positions so you don't have to 'intercept' draw calls and dumps them out in JSON.
1 comments

Thanks. I looked into that recently, it does make this a lot easier, so now I have a node version of this as well.