Hacker News new | ask | show | jobs
by Kevin_Marks 5228 days ago
Relying on RSS feeds is tricky, as many of them are partial extracts, summaries, or just plain wrong (eg archival standalone pages linking to the current front page, stale feeds, links to now-defunct feed services).

If you want to help people writing these things, using hAtom in your HTML is a really good idea.

http://microformats.org/wiki/hatom

1 comments

Also HTML5 incorporates the article element/tag to help extract article contents: http://dev.w3.org/html5/spec/Overview.html#the-article-eleme...
The thing here is that when properly used, a page can contain several pieces of text tagged as articles, especially blogs with comments (think of article as "an article of clothing", not as "a magazine article"). You'd have to rely on other heuristics to find the "correct" article, which probably is not that much easier than finding the correct div element.
Take a look for example at Fred Wilson's blog http://www.avc.com he is using the article element. You can use multiple times in different blog posts in the same web page and I don't feel this is bad.