Y
Hacker News
new
|
ask
|
show
|
jobs
by
rodion_89
4841 days ago
Their robots.txt file clearly asks to not be crawled at all.
User-agent: * Disallow:
http://digg.com/robots.txt
2 comments
blauwbilgorgel
4841 days ago
Incorrect. To configure your robots.txt to not be crawled at all use:
User-agent: * Disallow: /
Allow indexing of everything with:
User-agent: * Disallow:
It seems they are at this very moment struggling to change things.
link
rodion_89
4839 days ago
You are totally right. Disregard everything I said.
link
chenster
4841 days ago
Disallow nothing -> allow everything.
link