Hacker News new | ask | show | jobs
Physical and Logical Structure Recognition of PDF Documents (bloechle.ch)
4 points by NWoodsman 1305 days ago
1 comments

An interesting thesis from 2010 showing PDF document reconstruction via machine learning in order to rebuild a PDF with an object model.

Can anyone shed light on what the state of this art is currently?