| Way question is presented, falls under the category of "NP vs P"[2][3] Historically, Sneakernet / Physical/limited access personal library / "not for distribution outside company" was the way. Simplest way would be to have private / internal network with no outside internet access. This doesn't prevent sneakernet ports to machines with outside access. Nor does this prevent an LLM on usb stick from 'scanning' and/or unintentional 'picture uploads'. How would one identify LLM scanning from non-LLM scanning (beyond 10,000,000 requests per second from single source)? Checking a sites robot.txt is on honor system. And similar related things where there is a specific way to idenify valid/invalid access, such as fail-to-ban, are a never ending battle of being updated/revised to remain current. License or no license, sort of a different take on turning test of can an ai fool a human into believing ai is a human[0]. capture system[1] to verify not a bot example of this. [0] : https://en.wikipedia.org/wiki/Turing_test [1] : https://en.wikipedia.org/wiki/CAPTCHA [2] : https://en.wikipedia.org/wiki/P_versus_NP_problem [3] : https://news.mit.edu/2009/explainer-pnp |