Hacker News new | ask | show | jobs
by cocktailpeanuts 1801 days ago
How would you know which stock photo was used for training? Someone may take a whole bunch of images they buy on a dark market, create a huge train model, and dump it on the public internet over torrent or something. And there would be no way to know which images were used to train the model
1 comments

Perhaps as this technique becomes more and more used there will be regulations on the source data set, such as the requirement to prove you own all of it (or that it is in the public domain).