Hacker News new | ask | show | jobs
by dleeftink 486 days ago
An oldy but goody for layout extraction is Cermine by Dominika Tkaczyk and colleagues[0]. Java required.

[0]: http://cermine.ceon.pl/about.html

2 comments

didnt know this!