| I'm going to be controversial here, and say that its possible that light colored skin is easier to identify from background, giving higher confidence to "is face" detection with lower false positives. Brown is, next to green, the most common color in nature. Lighter color also has a greater contrast with the background when using camera flash or other subject lighting. To be really blunt, I think white faces are easier to see. A common example in nature, white tailed deer. When tail is raised, their white fur is a clear danger signal because it stands out from the background. Like most mammals, the rest of their body is a shade of brown. I don't think its racist to suggest that light skin, just like blue eyes, is a conspicuous highly visible signal suggesting a human face. I find it hard to believe that such a large number of facial recognition models have an easier time identifying white people after years of publicity solely because of biased training data. The ultimate example is probably blue eyes. Almost no other animal has blue eyes. Blue is extremely rare in nature. I want to see a study on facial recognition, blue eyes vs other colors. I bet money you will find that models excel at recognizing human faces with blue eyes above all other factors |
Imagine someone makes a diverse facial recognition model where the confidence for detecting black faces are usually 99.96% even in rough lighting and 99.99% for white faces. They may have an acceptance bar at 95% so it's well within tolerance.
Combine that with an auto-cropping algorithm that takes a image, does computer vision on it, and selects the object that has the highest confidence and fits within the crop window.
When tested on both, portraits of white people and black people, it would pass, but in the examples, it falls down.
I say all of this not to excuse Twitter -- I still think they need to rethink how their autocrop works and fix it, but I don't think people these days do any type of people recognition without thinking of diversity.