|
|
|
|
|
by Eridrus
907 days ago
|
|
Exactly; there is no need to do this in the model, you just need well understood token retrieval methods for identifying copyright infringement that ChatGPT's competitors already have. You will get into some murky definitions of what is exactly required for copyright infringement vs fair use, etc, but we already do this for ContentId for YouTube and text is far simpler. |
|
What if I write a story and publish it on my blog. Should I be required to submit this to openAI's copywrite model to ensure the story is never used in openAIs other models? What about the other 100 AI model companies that are going to spring up in the next year?
It should be on the curators of the training set to ensure all material inside is fair for them to use.