Hacker News new | ask | show | jobs
Text-Only NPR (text.npr.org)
27 points by imwally 584 days ago
2 comments

This reminds me of the CNN Lite website -- https://lite.cnn.com/
Both of which I prefer to the ordinary versions. So much less noise, much slower pace. Only 80% junk, which is super low by the standards of modern news websites.
Perfect for LLMs!

We use a html->markdown converter for web scraping and sites like this make it even easier/more robust.

Side note: bring back the RSS feeds?

Curious, can you recommend a HTML -> MD converter?
pandoc is pretty good https://pandoc.org/demos.html
D'oh, thanks — I use Pandoc plenty to MD -> HTML I totally spaced it can do the inverse. I was briefly looking at [Turndown](https://github.com/mixmark-io/turndown) for projects in the JS ecosystem.
Why not just use the HTML ....?