Y
Hacker News
new
|
ask
|
show
|
jobs
by
syllogism
4290 days ago
My suggestion would be to understand the Boilerpipe algorithm, which as far as I can see is the best available solution (and much clearer than readability):
http://www.l3s.de/~kohlschuetter/publications/wsdm187-kohlsc...
You can then easily adapt it for your requirements.