Y
Hacker News
new
|
ask
|
show
|
jobs
by
zamadatix
19 days ago
Common Crawl doesn't publish raw DNS separately, you have to pull the information out of the aggregate database. The WARC-IP-Address header should record the IP Common Crawl connected to for the site.
1 comments
ccgreg
19 days ago
Good timing, I'm about to release that dataset.
link