Hacker News new | ask | show | jobs
by chip 5863 days ago
Ah, that is the result of the Readability, http://lab.arc90.com/experiments/readability/ algorithm, it is a bit greedy and chopped off some of the content.

If there are better algorithms/tools for scraping content please let me know.