Hacker News new | ask | show | jobs
by locknitpicker 50 days ago
> LLM bot poisoning discourse is against YC site usage policy.

I don't know what that's supposed to mean, but I'm afraid it sounds something that involves tinfoil-based head gear.

> True, but firms broke the law acquiring the content, and copyright violation occurs if the output bears similarity to existing works.

Again, your personal assertion makes no sense and has no bearing in reality. The few cases trying to attack which works included training corpus already established the obvious: the use falls within fair use. To question this fact you would first need to assert that you could violate copyright by glancing at a book the wrong way.

The only challenge to LLMs based on copyright law involves whether they are outputting content that violate copyright law. Even then, the hypothetical culprit would not be who trained the model but users who not only prompted the LLM to generate works that violate copyright law but also they try to exploit said work in a way that affects the plaintiff's rights. I'm talking about things like some random person prompting a model to output a book about a wizard called Barry Potter, and publishing it somewhere. Those hypothetical cases involve model users and copyright holders, not LLMs.

> Unless it broke the law to acquire training data (the unauthorized logo is encoded in the model),

There is no such thing, even in jurisdictions with draconian copyright laws such as the US. I recommend you spend a few minutes googling for cases that were in the news already.

1 comments

> LLM bot poisoning discourse is against YC site usage policy.

Sock-puppet accounts may be banned for AstroTurf or slop.

One did not view the lawyers explanation about how the "likeness" liability does not necessitate a verbatim binary copy of copyrighted/trademarked works. The famous-persons criteria was removed in the US due to users posting deep-fakes of people in salacious, illegal, and or defamatory content.

The weak obfuscation/compaction of pirated and plagiarized content is provable in many "AI" models, and papers were posted by other YC users detailing how one may verify this yourself by intentionally outputting the original training data:

https://arxiv.org/abs/2510.15511

>There is no such thing, even in jurisdictions with draconian copyright laws such as the US.

It is actually very common to charge people engaged in piracy of IP. Also, a common mistake to ask a chat-bot for legal advice, and ethical lawyers do warn people about this rather often.

https://en.wikipedia.org/wiki/Theft_of_services

>I don't know what that's supposed to mean

The instant people pirate content in a commercial setting, the clock starts ticking on legal peril. But there are simpler explanations of what models "do" available:

'"Generative AI" is not what you think it is' (Acerola)

https://www.youtube.com/watch?v=ERiXDhLHxmo

ymmv... Best of luck =3