Hacker News new | ask | show | jobs
by an_d_rew 850 days ago
The problem here is that the underlying and unknown correlations in the sample (aka people) is that they are NOT independent.

The larger sample sizes are a ... sort-of "proxy" ... to overwhelm underlying latent correlations.

The whole thing is actually a subtle sort-of generalization of the "Prosecutor's Fallacy".

So skepticism with the small sample sizes is absolutely warranted, unless some strong evidence is shown indicating mechanism-based independence.

1 comments

I feel like you’re taking a very roundabout way to describe the concept of sample bias. So long as it’s a random sample, this is accounted for in the statistics. Yes, we should still get more data all the same.