Hacker News new | ask | show | jobs
by sampo 1188 days ago
> The general problem of "AI"s being trained on copyrighted content

> The current (legal) answer is "unclear".

European Union was ahead of times for once. The 2019 copyright directive, article 4, makes it legal to scrape the web and make and keep local copies of copyrighted works, for data mining purposes. Unless the copyright holders set up a machine readable exception (such as robots.txt file).

So legal in EU, "unclear" in US.

2 comments

That does not, to me, automatically imply that an "AI" lawfully regurgitating copyrighted content is a "data mining purpose".
Consider that an AI may cite many snippets of copyright publications into a chimera of 'Facts'.

'copyright fair use' : https://copyrightalliance.org/faqs/what-is-fair-use/

Does OpenAI respect Robots.txt? Do we know?