Hacker News new | ask | show | jobs
by visarga 884 days ago
I think copyright lawsuits against AI companies will force them to develop attribution models. They will do the work of indexing all ideas to their authors. This will also reveal what is common knowledge, and who borrowed from who without attribution.

In order to make attribution models we need text+author+timestamp. We can get that from books, newspaper articles, scientific papers and social network posts. Then we extend to the rest of the training set.

But then we can also make AI models that cleverly avoid infringement while the same strict checking is going to be applied to human made content. Humans are not that good at avoiding pitfalls.