Hacker News new | ask | show | jobs
by a13o 1252 days ago
On the topic of Content is King, I have a different view than the author. I think in the case of these trained AIs, 'content' refers to the training datasets and not the generated outputs.

Trained AIs are in something like the early digital streaming days where there was only one provider in town, so that provider aggregated All The Content. Over the next decade we would see the content owners claw their content back from Netflix, and onto competitor platforms -- which takes us to where we are today. Netflix's third party content has dwindled and forced them to focus on creating their own first party content which can not be clawed away.

When these generative AIs start to produce income, it will be at the expense of the artists whose art was in the training dataset nonconsensually. This triggers the same content clawback we saw in digital streaming. Training datasets will be heavily scrutinized and monetized because the algorithms powering generative AIs aren't actually carrying much water. What is DALL-E without its dataset? Content is King.