| Any analogy is flawed and you can kill most analogies very fast. They are meant to illustrate a point hopefully efficiently, not to be mathematically true. They are not to everyone's taste, me included in most cases. They are mostly fine as long as they are not used to make a point, but only to illustrate it. I agree with this criticism of this analogy, I actually had this flaw in mind from the start. There are other flaws I have in mind as well. I have developed more without the analogy in the remaining of the comment. How about we focus on the crux of the matter? > A web server that accepts a GET request and replies 2xx is distinctly NOT "locked" in any way The point is that these scrappers use tricks so that it's difficult not to grant them access. What is unreasonable here is to think that 200 means consent, especially knowing about the tricks. Edit: > you're more than welcome to put an authentication gate around your content. I don't want to. Adding auth so llm providers don't abuse my servers and the work I meant to share publicly is not a working solution. |
As the web server operator, you can try to figure out if there's a human behind the IP, and you might be right or wrong. You can try to figure out if it's a web browser, or if it's someone typing in curl from a command line, or if it's a massively parallel automated system, and you might be right or wrong. You can try to guess what country the IP is in, and you might be right or wrong. But if you really want to actually limit access to the content, you shouldn't be publishing that content publicly.