|
|
|
|
|
by joiguru
1757 days ago
|
|
The basic idea is as follows. Lets say you are building an ML model to decide whether to give someone insurance or not.
Lets also assume your past behavior had some bias (say against some group).
Now ML model trained on this past data will likely learn that bias. Part of modern ML focus is then to understand what bias exists in data, and how can we train models to use the data but somehow counteract that bias. |
|
This seems like a hard problem. For example, say that you have an ML model that decides whether someone will be a good sports athlete or not purely based on biometrics (blood oxygen level, blood pressure, BMI, reflex time, etc.). If the model starts predicting black people will be better athletes at higher rates than white people, is the ML model biased? Or is the reality that black people have higher-than-average advantageous physical characteristics? How do you tell the difference between bias and reality?