Hacker News new | ask | show | jobs
by kneel25 74 days ago
> a lot of websites

It was a dataset of the entirety of the public internet from the very beginning that bypassed paywalls etc, there’s virtually nothing they haven’t scraped.