Hacker News new | ask | show | jobs
by zamadatix 19 days ago
Common Crawl doesn't publish raw DNS separately, you have to pull the information out of the aggregate database. The WARC-IP-Address header should record the IP Common Crawl connected to for the site.
1 comments

Good timing, I'm about to release that dataset.