Hacker News new | ask | show | jobs
by rodion_89 4841 days ago
Their robots.txt file clearly asks to not be crawled at all.

  User-agent: *
  Disallow:
http://digg.com/robots.txt
2 comments

Incorrect. To configure your robots.txt to not be crawled at all use:

  User-agent: *
  Disallow: /
Allow indexing of everything with:

  User-agent: *
  Disallow:
It seems they are at this very moment struggling to change things.
You are totally right. Disregard everything I said.
Disallow nothing -> allow everything.