Hacker News new | ask | show | jobs
by catalogia 2445 days ago
Is it so hard for you to believe there might plausibly be a causation?

Consider the case of African Americans who are discriminated against by traffic cops. Is it plausible that African Americans, in an attempt [perhaps in vain] to minimize interaction with traffic cops, are more cognizant of traffic laws and drive more conservatively than the average American? I don't know if the data supports that hypothetical, but it seems plausible to me.

Assuming that this were the case, if you were to assume that African Americans drove as well as white Americans, you would be discriminating against the African American population by failing to recognize their safer driving habits.

2 comments

Whether you or I think there is a causation in specific cases is irrelevant, as is whether we apply charged terms like "racism" to certain causal linkages.

The point is that one is not compelled to believe in the causal link just because there is a statistical link.

So if certain causal links are politically contentious, rejecting them due to "political correctness" is completely separate from rejecting the facts, the statistics that are collected. It is political, but not in opposition to reality.

The article, as I understood it, is puncturing the assertion of objectivity by those who implicitly assert we have to regard all correlations as equally causal or else be against reason and logic.

I certainly do not believe anybody should be compelled to assume a correlation is a causation. However I also do not think one should preemptively rule out the possibility of causation. Without examining the nitty gritty details of any particular situation, we can't know which is the case. We certainly cannot assume one and rule out the other, which I fear is what you assumed I was doing.
The elephant in the room is that the real way to tell whether that is the case would be to use race as a factor the same as age or sex. If African Americans are more careful drivers then that would detect it and take it into account.

But then you have to take the bad with the good. If it turns out that strict adherence to traffic laws that nobody else abides is actually more dangerous than following the normal flow of traffic, it would also detect that and take it into account.

Well it may be the case that they accidentally have a proxy for race already in their data (the "this ethnicity prefers red cars" hypothetical in the article above.) So race may already in practice be factored in despite nobody intending for that to be the case (assuming nobody anticipated that a particular metric is a racial proxy.) That does not necessarily mean it's being unfair to that race though. It could, hypothetically, mean that it's actually being fair to that race, advantaging them in a system/society that would otherwise disadvantage them.
The whole "proxy for race" thing is such a mess.

The original problem was that racists were not just taking race into account but were disproportionately penalizing certain races. They would literally just refuse to do business with black people. And then once that was made illegal, they would refuse to do business with people from black neighborhoods (redlining), i.e. use location as a proxy for race so they could continue to refuse to do business with black people under that pretext.

Normal Bayesian statistics doesn't do that because it's missing the actually racist piece of it, which is giving disproportionate weight to race (or something that correlates with race) so that you refuse disproportionately many people of a particular race for no legitimate reason.

The unfairness never came from taking into account some factor that correlates with race, or even race itself to the extent that it actually correlates with outcomes. It came from using a factor to deny service even though it didn't correlate with outcomes, or if it did then still not proportionately to the huge negative weight assigned to it. It came from giving race, or a proxy for race, disproportionate weight. Giving it proportionate weight isn't unfair, it's the only thing that is fair.

I fully agree and admire the skill with which you've stated this.
"then that would detect it and take it into account."

You have a method for automatically deriving causal relationships from correlational data?

A lack of a causal relationship wouldn't matter in that case. If something correlates with the outcome then it allows you to better predict the outcome even if it isn't the cause, because it at least correlates with the cause or it wouldn't correlate with the outcome.

Though obviously if it isn't the cause then you're better off taking into account the true cause rather than only the thing that correlates with it -- which would cause the correlation with the outcome to disappear for the non-causal factor when you take into account both.