Hacker News new | ask | show | jobs
by palmy 3207 days ago
This is the part of his explanation I find confusing too, though I do agree with his general argument: the results aren't really useful when they haven't been tested on a dataset representative of the real distribution.

From what I can gather he's basically saying that the "probability of a random face being classified as homosexual" is 0.5. This isn't REALLY true (would have to run the classifier on all possible faces to find this), but that is in fact the "environment" the classifier has been trained within.

1 comments

If the test set of images really were 50/50 and the human judges weren't told that, then they were effectively given inaccurate priors, which would obviously reduce their accuracy.