Hacker News new | ask | show | jobs
by haddr 796 days ago
Some years ago I compared those boilerplate removal tools and I remember that jusText was giving me the best results out of the box (tried readability and few other libraries too). I wonder what is the state of the art today?
1 comments

This is worth having a look at: https://mixmark-io.github.io/turndown/

With some configuration you can get most of the way there.