|
|
|
|
|
by vlovich123
1234 days ago
|
|
Hmmmm… I have code where I’m randomly sampling an exponential function and even thousands of samples are insufficient to pass chi-squared tests at 95% accuracy that the observed distribution matches my expected ground truth exponential function. The reason? Chi-squared needs 5 samples at the tail which has an effective probability of 0. And if I try to flip it and say “run the experiment with 500 samples 100 times but verify the observed matches the expected with a 5% error”, I’ll still see more than 5 runs that fail this. Is there something special about exponential functions or is it just my misunderstanding of statistics/calculus at play here for doing this correctly? I assume it’s the latter but I haven’t figured out what I’m doing wrong. |
|
In any case, it sounds like maybe it falls under the "if you're interested in things that are rare" paragraph in my post above. You can always design statistics that are arbitrarily hard to estimate. The things that we're typically interested in estimating in real life, though--averages, proportions, and similar--are typically estimable with reasonable sample sizes.