Hacker News new | ask | show | jobs
by KronisLV 1018 days ago
I wonder where you're getting all of the blogs from, whether it's just user suggestions or lots of personal favorites. Sounds like a bit of work but I'm glad that people are trying to do that sort of thing and helping others discover more articles!

I built the Hacker News Personal Blogs site a while ago, after people shared their blogs in an Ask HN post and then someone compiled those into an OPML file on GitHub (those that had Atom/RSS feeds): https://hn-blogs.kronis.dev/ The List Of Blogs page might be relevant to discover a few more blogs to add, if anything catches your eye. That site of mine is pretty basic and I just threw it together in a day or two because it felt possible, but I hope that your site actually goes somewhere in comparison!

Also, embedding the blogs inside of iframes is pretty cool, I wonder how common people preventing that with X-Frame-Options or frame-ancestors is. It was actually interesting to see posts from Substack and other sites being captured as screenshots instead, as a workaround.

1 comments

This is neat, thanks. I was looking at your HN OPML file on GitHub [0] and noticed that the `xmlURL` and `htmlURL` attributes are the same for each entry; the `htmlURL` currently points to the feed rather than the site. Do you happen to have the original HTML URLs available? Would be nice to have both.

(Secondarily, I'm guessing some of the `type` attributes should probably be "atom" rather than "rss"?)

[0] https://github.com/outcoldman/hackernews-personal-blogs/blob...

> I was looking at your HN OPML file on GitHub [0] and noticed that the `xmlURL` and `htmlURL` attributes are the same for each entry; the `htmlURL` currently points to the feed rather than the site. Do you happen to have the original HTML URLs available? Would be nice to have both. (Secondarily, I'm guessing some of the `type` attributes should probably be "atom" rather than "rss"?)

I'm just using the file that someone else made, but I guess they didn't really make the distinction between those URLs in the code, though it shouldn't be too hard to modify: https://github.com/outcoldman/hackernews-personal-blogs/blob...

It is also true that there are both RSS, Atom and possibly other feed types mixed in there. What I did for my site was to crawl through all of those feeds and process them one by one: get all of the posts, do some ordering and grouping and output everything as RSS feeds in a consistent format.

For example, here's the top 100 user feeds for 2023: https://hn-blogs.kronis.dev/feed-top100.xml

Those have HTML links for each of the posts, though I'm afraid it's not exactly what you're asking for (the HTML URLs for the sites/feeds themselves), because I don't actually store that anywhere in my case, though the original feeds of the sites should have that information too.