Hacker News new | ask | show | jobs
by cj 647 days ago
Related article from 4 days ago (with comments on scraping, specifically discussing removing HTML tags)

https://news.ycombinator.com/item?id=41428274

Edit: looks like it's actually the same author