| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by noogle 1189 days ago
	The current (legal) answer is "unclear". There are indications that training is fine, but producing and using the generated content is questionable at least. As many IP issues, it will solved only when someone will try that in court and go all the way until a verdict. Some cases are actually being processed but it might take years to get an answer.

1 comments

sampo 1189 days ago

> The general problem of "AI"s being trained on copyrighted content

> The current (legal) answer is "unclear".

European Union was ahead of times for once. The 2019 copyright directive, article 4, makes it legal to scrape the web and make and keep local copies of copyrighted works, for data mining purposes. Unless the copyright holders set up a machine readable exception (such as robots.txt file).

So legal in EU, "unclear" in US.

link

pklausler 1189 days ago

That does not, to me, automatically imply that an "AI" lawfully regurgitating copyrighted content is a "data mining purpose".

link

News-Dog 1189 days ago

Consider that an AI may cite many snippets of copyright publications into a chimera of 'Facts'.

'copyright fair use' : https://copyrightalliance.org/faqs/what-is-fair-use/

link

EamonnMR 1189 days ago

Does OpenAI respect Robots.txt? Do we know?

link