| You can't stop people from polluting the web with ai-generated outputs (and therefore contaminating data sets you hope to be able to be able to assume are human-generated) until you create a humanweb (fuck the 'web3' attempts we've had so far, web3 ought to be human-verified vs non-human-verified web) that has real, effective human-verification on inputs built-in. The regular web will still be useful but for an increasing number of applications you'll need to go to the humanweb to get what you need where self-feeding hallucinations and sloppy modelpaste isn't everywhere. If people are mad that Twitter gives a megaphone to everyone, including the ignorant masses, then they'll love the auto-spam that LLMs are going to create. Anything that can be said will be said. You want a Reddit with humans? Ha. On the regular non-human-verified discussion platforms of tomorrow, you'll be lucky if 4% of the comments you are replying to and arguing with even have a human on the other end, but the good news is the rebuttal comment you posted after having too much coffee will be ingested and used for training of the next version of the model you're arguing with. So your original content human-input may be parroted much more broadly than it would've been on the pre-LLM web. If LLM spam really does flourish and spread misinfo and hallucinations everywhere and we don't develop good automated means to prevent it or to verify content, it may be necessary for a central authority/business to maintain hardware terminals at distributed, centralized locations for interacting with the humanweb that you can't install or control the software on and where a human or a camera is watching you physically type on the keyboard to make sure you aren't just automating the inputs physically with some software->machine->keyboard interface or connecting some virtual keyboard. Think a locked-down public library computer but you're watched while you interact with it, and they're deployed and administered across the planet by a trusted multinational for sensitive usages where you absolutely need to ensure the inputs are from humans. You wanna get real fun and cyberpunk novel thought-experimenty, picture prison-like security, physical pat-downs or even a requirement that you use the terminal naked and are body-searched for devices. Maybe x-ray scanned for implanted hardware. Of course the whole thing falls apart if the trusted authority that administers the hardware is compromised but at least you stop some of the non-state actors and script kiddies. |