|
|
|
|
|
by lordnacho
146 days ago
|
|
It seems the criticism is indeed Berkson's Paradox, but the example is different to the canonical example of Berkson's paradox. In the canonical example, you have uncorrelated attributes, eg skill and attractiveness in actors, forming a round scatter plot with no correlation. Selecting a subpopulation of top actors who are either skilled or attractive, you get a negative correlation. You can visualize this as chopping the top-right of the round scatter plot off: the chopped off piece is oriented in roughly a line of negative correlation. In this example, if you look in the linked paper inside the post by Dimakis, there is a positively correlated scatter plot: You can tell the shape is correlated positively between youth and adult performance. But in this case, if you condition on the extremes of performance, you end up selecting a cloud of points that has flat to slight negative correlation. |
|
Uncorrelated attributes:
Looking at just the Included points shows clear (spurious) negative correlation.Correlated attributes:
The Included points still have a negative spurious correlation, though it's smaller than for the uncorrelated cartoon.