| Machine learning does not have less bias than human researchers. It is simply magnified at scale. And that scale is exactly the state of the internet. There is so much data available to study and understand, that we absolutely need better tools, like machine learning or whatever we want to call it, to help us keep up. Shit's moving faster than our human perception can handle, especially for those who didn't grow up with the internet. Yes the data analyctic tools we have right now are premature— like fast food to our productized minds— but they will improve rapidly, as our taste for quality improves. But sure demonizing the things you don't like is one step on the path to learning what's truly valuable. |
My go-to example is machine learning police enforcement direction, often used as a counter to racially biased policing. This works in any city with a historical problem of racial bias in policework. We give the algorithm all the data we have from the last 60 years of policing this city. Patrol schedules, incident records, arrest records... everything. The computer magically tells us where we should focus our efforts. To the police chief who paid for the system, and especially to the media reporting on it, it looks like a computer is making the decisions without bias. Hooray!
Of course, anyone who's ever worked with machine learning can spot the problem. The data set was generated by racially biased policing. That bias will be reflected in all the records: more arrests for race X, more patrols scheduled through their neighborhoods, more incident reports from those areas. So when the algorithm says "increase patrols in this neighborhood," or "look for people who fit this profile," it is simply synthesizing the patterns from 60 years of racial bias. So the police in LA have a real problem: their "unbiased" computer program is telling them that their criminals look like black people, and they should increase patrols in Compton. So they do, and that data only takes the data further from "un-biased" reality. In fact, the police "black box" is only pointing out a history of racially biased policing. We're relabeling it as recommendations for future behavior.