|
|
|
|
|
by alexssung
3388 days ago
|
|
I've never heard of death by a thousand control variables. What's exactly wrong with data controlled by relevant variables such as education, hours worked, and experience? If you find a combination of variables that reject the hypothesis, maybe you should look further into it, no? Dismissing it because you think it's not objective for some absurd reason is rather unscientific. "Trying all possible combinations of variables" does not disqualify an analysis from being objective--not sure where you got that idea from. |
|
What's not okay is sequentially trying all of the possible analyses and then stopping the moment you find the exact combination of variables that tells you what you had already assumed to be true. Especially so when simple analyses point to A, and you keep adding new variables until you get to B, which is exactly the case here. That is a very well known abuse of statistics. There is a reason all the well known and popular Information Criterions (which measure model quality) are parameterized by the number of parameters in the model.
And while adding control variables isn't per se bad, there are proper precautions to take when doing so, which become exponentially more costly the more you add. Such as segmented sampling, non-linearity transformations, and even controlled experiments. Because these fraudsters have a motive, the model only needs to be as rigorous as necessary to secure their predetermined conclusion. The "keep adding variables" model almost always ends up as a way to lie with statistics.