| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Eridrus 907 days ago
	Exactly; there is no need to do this in the model, you just need well understood token retrieval methods for identifying copyright infringement that ChatGPT's competitors already have. You will get into some murky definitions of what is exactly required for copyright infringement vs fair use, etc, but we already do this for ContentId for YouTube and text is far simpler.

1 comments

noitpmeder 906 days ago

This is bogus. Now you require that every piece of copywriter be registered and indexed in a central authority?

What if I write a story and publish it on my blog. Should I be required to submit this to openAI's copywrite model to ensure the story is never used in openAIs other models? What about the other 100 AI model companies that are going to spring up in the next year?

It should be on the curators of the training set to ensure all material inside is fair for them to use.

link