|
|
|
|
|
by AstralStorm
397 days ago
|
|
The practice you describe is called data dredging though. The thing about it is that you do not know enough experimental design details to make sure it was all on the up, especially worse the older the dataset gets. Normally when doing that you need a multiple comparison corrections and conservative stats. That won't get you published though, or if you do get published you won't get noticed except by someone running a meta analysis. Perhaps not even then.
Usually you end up with negative results from reanalysis, evidence of tampering or small effect sizes. And this does not that reliably detect dataset manipulation, p hacking on the part of experimenters or accidental violations of the protocol, not even necessarily if the data collection included measures to prevent it. In short: you cannot 100% trust any dataset you did not make. Not even as part of the team that makes it. |
|