"For example, if a user specifically asks for a URL's full text, it might inadvertently fulfill this request." is even clearer. Like, in what cases would you not want to fulfill that request?
How do you suggest website owners do that? By becoming aware, after the fact, that they can add OpenAI to their robots.txt file, and hoping for the best in the next model update? By adding a paywall? By not publishing at all?