Hacker News new | ask | show | jobs
by dudeinjapan 460 days ago
In practice, robots.txt is to control which pages appear in Google results, which is respected as a matter of courtesy, not legality. It doesn't prevent proxies etc. from accessing your sites.