Hacker News new | ask | show | jobs
by maxrmk 82 days ago
It’s easy to opt out of being indexed by Google.
1 comments

Exactly. Identifying crawlers like Google, bing aren't the issue. They obey robots.txt, and can easily be blocked by user agent checks. Non-identifying crawlers, which provide humanlike user agents, and which are usually distributed so get around ip-based rate limits, are the main ones that are challenging to deal with.