|
|
|
|
|
by Natsu
5270 days ago
|
|
The information about how old a person looks exists in the image itself. You can compare the size and proportions of a person's features to known examples. No one is going to mistake a four-year-old for a grandmother. The information about whether or not someone has permission to copy something does not exist in the file: the infringing copy and the legitimate copy are bit-for-bit identical. You have to get the data on who has permission to do what from somewhere else. The Viacom v. YouTube case proved that even expensive lawyers spending hours doing research can't manage to get even 99% accuracy. They had to remove things that Viacom itself had uploaded (or authorized to be uploaded) from that case. Twice. Now suppose we have a magic system that's better than expensive copyright lawyers that can manage 99% accuracy (which they didn't) and that we operate at internet scale on the roughly 60 billion web pages Google indexes. Then assume that no less than 20% of the internet is pirated material. Does that sound too high? Good. Your numbers will look even worse if it's lower. Now we do a little math: 0.2 * 0.99 = 19.8% are pirate pages correctly blocked
0.8 * 0.99 = 79.2% are innocent pages correctly allowed
0.2 * 0.01 = 0.2% are pirate pages mistakenly allowed
0.8 * 0.01 = 0.8% are innocent pages mistakenly blocked
So we've just censored four innocent pages for every one containing pirated material and there would be 120,000,000 pirate pages (0.2% of 60 billion) out there that aren't blocked. Feel free to work out the math, but if you want to say that 20% is too high a proportion, you'll have even more false positives, so even more innocent pages are censored for every pirate blocked.And this assumes that we have something better than ~$300/hr copyright lawyers screening all 60 billion web pages. Any computer program we make won't even be this good. |
|
No one is going to mistake a four-year-old for a grandmother.
True. But plenty of people will mistake a 14, 15, 16, or 17-year old for an 18-year old.
The Viacom v. YouTube case proved that even expensive lawyers spending hours doing research can't manage to get even 99% accuracy.
Again true, but the odds of a randomly selected uploaded full-length movie being illegally uploaded is, I bet, significantly larger than the odds of a randomly selected explicit movie featuring people of illegal age. So the prior coming from what fraction of available content out there is illegal according to the two standards would likely lead to a larger false positive rate for pornography.