| Aha - I think I see our miscommunication. When you say bias you mean statistical bias. Yes, machine learning is able to correct for that kind of bias - 538's polls forecast is a good example of that. But you don't get to redefine racial bias to be something innocuous. Yes, black names are more likely to have arrest records, but that "fact" is super misleading [1]. Finally, you're talking past me. I'm not saying that statistics is broken. I'm saying that we should be especially mindful of the OPs point when they say this: > So what’s your data being fried in? These algorithms train on large collections that you know nothing about. Sites like Google operate on a scale hundreds of times bigger than anything in the humanities. Any irregularities in that training data end up infused into in the classifier. I think the OP author also has a related post about the kind of bias I'm talking about: http://idlewords.com/talks/sase_panel.htm [1]: http://www.huffingtonpost.com/kim-farbota/black-crime-rates-... |
You are saying that algorithms are accurately measuring a reality you wish were different. I don't disagree with this.
The right thing to do is to actually answer unpleasant moral questions like "if blacks are 4x more likely to be dangerous criminals, what should we do about it?" But I guess overloading the word "bias" is a nice substitute for clearly thinking things through.