| >they only test null hypothesis that are true. If a null hypothesis is invariably true, it's impossible to reject it. Which means the scientists will not be able to find any statistic or data to support any of their bad, original hypotheses. Not 5%, not 0.005%, nor whatever. p-values are not flawed. They are a useful tool for a certain category of jobs: namely to check how likely your sample is, given a certain hypothesis. The argument in the original post is a bit of a straw man fallacy. "I want to know the probability that the null is true given that an observed effect is significant. We can call this probability "p(null | significant effect)" OK, hypothesis testing can't answer this type of questions. Then "However, what NHST actually tells me is the probability that I will get a significant effect if the null is true. We can call this probability "p(significant effect | null)"." Not quite correct. It's "p(still NOT a significant effect whatever it means | null)". EDIT. Fixed the last sentence. |
Why argue when you can simulate:
Lo and behold, we reject the null hypothesis that the mean of a normal distribution is equal to zero in 5% of all simulations, even though the null hypothesis is in fact true. (`rnorm` defaults to 0 mean and 1 sd)