Hacker News new | ask | show | jobs
by gwern 1543 days ago
Probably. HN is fairly plain HTML so Common Crawl should have no issue crawling it, and I'm not aware of any HN optout there (which would go against the usual public accessibility of everything on HN to APIs and projects etc), nor would any of the obvious data-filtering measures filter it out.