Hacker News new | ask | show | jobs
by rideontime 820 days ago
How would you propose doing so?
3 comments

Off the top of my head:

- Cloudflare

- Webserver-level user-agent blocking (Apache, nginx)

- Application-level user-agent blocking (`if request.user_agent == 'OpenAI'`)

None of them are ideal since you can simply change your user agent, but all of them seem like better options than robots.txt to me.

We could repurpose the evil bit.
One second, let me google this.

e: Okay, this is funny.

Web servers can check the user-agent and block the request.

E.g. nginx $http_user_agent