Hacker News new | ask | show | jobs
by syllogism 4290 days ago
My suggestion would be to understand the Boilerpipe algorithm, which as far as I can see is the best available solution (and much clearer than readability): http://www.l3s.de/~kohlschuetter/publications/wsdm187-kohlsc...

You can then easily adapt it for your requirements.