Hacker News new | ask | show | jobs
by johnm212 1891 days ago
> we scraped it all (at least as many as we could find on their English site) 12,118 questions

There's a lot more questions on Yahoo Answers. Their robots.txt file contains a sitemap: https://answers.yahoo.com/sitemaps/sitemap-us.xml

That sitemap contains links to 2,115 other sitemaps. Each of those sitemaps contains links to around 47k questions.

1 comments

Good find! Worth a follow up to get more!