Hacker News new | ask | show | jobs
by dutchbrit 1037 days ago
Just a thought that I have, wouldn’t it be better to block all robots and only to whitelist a select few? More AI bots are scraping now and in the future…
1 comments

Author here.

I wish I could, but I bet most would just ignore robots.txt.

Seconding. robots.txt is just a way of "asking nicely." If somebody wants to scrape, they're spoofing their UA and ignoring it. Can't do anything other than monitor the logs and ban IPs one by one.