Hacker News new | ask | show | jobs
by spacechild1 1 day ago
IMO the author absolutely nails it. Yes, it might not be the most nuanced take, but they express what many of us are thinking.

> Google and Yahoo and even DDG have been using copyrighted data for a quarter century, if not longer.

Your argument boils down to: companies haven't respected copyright or user consent for a long time, so why should we now care?

The answer is: it was wrong back then and it is still wrong now, but on a completely different scale! We have reached a tipping point where many people are (rightfully) furious.

> Consent was gone when Google, Yahoo, etc. started indexing the entire internet.

People do not have a problem with Google indexing the web because they want their websites to be found. That's the entire purpose of putting stuff out on the internet. AI is now doing the opposite: traffic is directed to chatbots and platforms are flooded with AI slop.

As you said, people were already complaining when Google started to show their own summaries, preventing users from visiting the original sources. People have been complaining about Facebook drowning the frontpage with clickbait. But this time it is so much worse.

Sure, LLMs can be useful, but on the other hand the potential for abuse is humongous. People are already seeing the negative consequences and they are rightfully angry.

> Come up with an interesting point or thought-provoking counter-argument and maybe people will take you seriously.

I do think that the author made a good point by bringing up the issue of consent. We are now in a situation where I can't post anything online without AI companies possibly scraping the content and feeding it into their models without any attribution or permission. And there is no way of opting out. Don't you see a problem with that? Why should we not care?

1 comments

> Your argument boils down to: companies haven't respected copyright or user consent for a long time, so why should we now care?

This is not a very charitable take. My argument is: that cat is out of the bag, so what do we do next? Do you have a plan? Ranting about consent or privacy or copyright won't somehow untrain these models. At least the math people put out the pretty sensible Leiden Declaration[1].

> Don't you see a problem with that? Why should we not care?

This is a lot like saying "do you not see a problem with nuclear weapons? They can wipe out humanity!"—I mean, yeah, sure I do, but ranting about the technology won't somehow put that cat back in the bag, either.

[1] https://leidendeclaration.ai/

> My argument is: that cat is out of the bag, so what do we do next?

I did not see this argument in your first post at all. Instead you tried to dismiss the author's complaints, arguing that none of their complaints are new. (Which is true in a sense, but it ignores the most important point: that of scale)

> I mean, yeah, sure I do, but ranting about the technology won't somehow put that cat back in the bag, either.

Fair enough, the author did not propose any concrete solutions. However, their analysis is still correct. The first step towards change is to acknowledge that we have a problem. Your first reply did not seem to do that, on the contrary, it seemed to downplay the authors concerns. Maybe that was not your intention, but that's how I read it.

I certainly agree that we need to find solutions. Yes, the cat is out of the bag, but with proper legal frameworks and regulations we can keep it from pissing all over the living room.

> Ranting about consent or privacy or copyright won't somehow untrain these models.

On the other hand pointed, organised, public dissatisfaction and upheaval could lead to legislation requiring companies caught in the act of theft to fork over their profits to the state to be used for public good. Think about it :)