|
This is a really good question, one that we've discussed amongst at our university (a lot of researchers at Carnegie Mellon). To be useful, ML systems have to have some kind of bias. However, the distinction here is that some of these biases are harmful biases. Kate Crawford talks about allocative harms (how resources are allocated) and representation harms (e.g. stereotypes). Some of these harmful biases are really blatant. For example, labeling Blacks as "Gorillas" is offensive for many reasons. Some of these biases "correct" but that's due to biases in the data set or society. The ProPublica investigation of recidivism prediction is a good example, where it was more likely to say that Blacks should not be released. However, police are also more likely to arrest Blacks, which naturally leads to this bias. Other examples here include Amazon's resume system that was biased against women (since they used Amazon's hiring practices as ground truth), and image search for "professional hairstyles" that showed White women but "unprofessional hairstyles" that showed Black women. Other biases are also "correct" but greatly miss the underlying context. For example, a naive AI system might tell you don't go to a certain medical doctor that is a professor, since they have a higher rate of deaths. However, this doctor might also be a doctor of last resort, hence the high mortality rate. What I'm trying to get to is that even the term "correct" has a lot of subtleties to it. In many cases, figuring out what is "correct" (or ground truth in ML terms) can be a clash of values and world view, and might have different results based on differences in race, gender, age, culture, context, and power. |
Given enough data, and no doubt Amazon surveils their people more than most, they could determine the 'truth' along a more straightforward line.
"Does this hair style make more money for the company"
As hair can be a strong form of expression, there's probably a measurable delta here.
Going forward, smart companies will obfuscate the determination. I suppose that training an AI is not a bad way to pull this off.