|
|
|
|
|
by stdbrouw
2984 days ago
|
|
> If a null hypothesis is invariably true, it's impossible to reject it. Which means the scientists will not be able to find any statistic or data to support any of their bad, original hypotheses. Not 5%, not 0.005%, nor whatever. Why argue when you can simulate: > n <- 50
> simulations <- 10000
> sd <- 1
> se <- sd/sqrt(n)
> crit <- 1.96 * se
> mean(abs(colMeans(sapply(rep(n, simulations), rnorm))) > crit)
[1] 0.0494
Lo and behold, we reject the null hypothesis that the mean of a normal distribution is equal to zero in 5% of all simulations, even though the null hypothesis is in fact true. (`rnorm` defaults to 0 mean and 1 sd) |
|
May I ask you why you chose to use the normal distribution in your example or any distribution at all, for that matter? What I was replying to was
">they only test null hypothesis that are true."
Which means that the null hypothesis is always true no matter what data you collect trying to reject it. It does not depend on the null distribution (normal in your example), the value of the test statistic (the mean of the sample in your example), or the threshold (crit in your example). In fact, the null distribution in this case is not a distribution at all since there's no randomness in the null hypothesis. We know for a fact that it is always true (in the hypothetical situation we are considering).
It's more like
or, if you insist on using the normal distribution, In fact, in your example, since you are essentially running 1000 hypothesis tests on different samples, multiple hypothesis correction would solve the "problem" with p-value. This is how I would do it. BOOTSTRAP CONFIDENCE INTERVAL CALCULATIONS Based on 1000 bootstrap replicatesCALL : boot.ci(boot.out = boot.out, type = "basic")
Intervals : Level Basic 95% ( 0.9774, 0.9780 ) Calculations and Intervals on Original Scale
P.S. p-values are great when used appropriately.