|
|
|
|
|
by mike_hearn
400 days ago
|
|
Surely some of the model bias comes from targeting benchmarks like this one. It takes left-wing views as axiomatically correct and then classifies any deviation from them as harmful. For example, if the model correctly understands the true gender ratios in various professions it's declared to be a "stereotype" and that the model should be fixed to reduce harm. I'm not saying any specific lab does use your benchmark as a training target, but it wouldn't be surprising if they either did or had built similar in house benchmarks. Using them as a target will always yield strong biases against groups the left dislikes, such as men. |
|
This is painting with such a broad brush that it's hard to take seriously. "Models should not be biased toward a particular race, sex, gender, gender expression, or creed" is actually a right-wing view. It's a line that appears often in Republican legislation. And when your model has an innate bias attempting to correct that seems like it would be a right-wing position. Such corrections may be imperfect and swing the other way but that's a bug in the implementation not a condemnation of the aim.