Hacker News new | ask | show | jobs
by chinathrow 3895 days ago
Google also adheres to the robots.txt standard. Most of the scrapers I block don't.
1 comments

Not correct. Google will completely ignore the rules in robots.txt if it deems it acceptable. I think there's a link to this somewhere in this comment page.
They do not index the content but might add the URL, correct. You can have a meta noindex present and they won't index even the URL.