Hacker News new | ask | show | jobs
by dec0dedab0de 1862 days ago
That's fine for training your own model, but I don't think you could distribute the training set. That seems like a clear copyright violation, against one of the groups that cares most about copyright.

I'm not sure that is a clear copyright violation. Sure, at a glance it seems like a derivative work, but it may be altered enough that it is not. I believe that collages, and reference guides like cliff notes are both legal.

I think a bigger problem would be that the scripts, and even the closed captioning, rarely match the recorded audio 100%

1 comments

And also... it's not like the program actually contains a copy of the training data, right? The training data is a tool which is used to build a model.