| To stop this, I a month or two ago put most of my Amazon Redshift research web-site behind a basic auth username/password wall. It all remains free, but you need to email me for a username and password. If I put in time and effort to make content and OpenAI et al copy it and sell it through their LLM such that no one comes to me any more, then plainly it makes no sense for me to create that content; and then it would not exist for OpenAI to take, or for anyone else to read. We all lose. It seems parasitic, and on the face of it, acting to kill the host. In fact, it essentially seems like abrogation of the concept of private property. The AI companies can take what I make, without my consent, which they sell for profit, where that profit it seems to be was formerly substantially coming to me, the return on my efforts. I had a look for ways to indicate to AI companies to remove my content. The methods provide are a fig leaf and put the onus on me, and in any event in a way which can never be known to have removed my content - "if you can show your content from a prompt, we will take steps to try to prevent that content from showing". As a consequence of putting up a username/password wall, Google has profoundly de-ranked the site, and I believe it is basically not being found on search any more. |
As you mentioned, they know they need good data though, so they might actually try to find some equilibrium.
If not, it's possible that the creation of new valuable content, to feed the LLMs, will be produced in-house by the AI labs. Sounds insane, but Netflix also makes their own content.
I think the AI labs will become so big that they'll take on more roles than just offering LLM inference. I think they'll become as or more powerful than many current nation state governments.