Hacker News new | ask | show | jobs
by rwilson4 1355 days ago
Gelman is one of the few self-proclaimed Bayesians who doesn't seem to outright hate frequentist approaches. They're complementary approaches. Bayesian methods are great for combining different sources of information. Frequentist methods are great for validating that a method is working well. (For example, Gelman often recommends running simulations to see if models give sensible predictions, but that is itself a pretty frequentist thing to do.)

Frequentism is mostly about how to evaluate a methodology. It's pretty agnostic about what that methodology is. Bayesian methods are about combining different sources of information. In a situation where you only have one source of information, Bayesian and Frequentist methods usually give the same answer.

People say you might as well always use Bayesian methods then. But no matter what, you should always try to validate or poke holes in your model, and Frequentist techniques are great for that. So it's best to be familiar with both!

3 comments

> running simulations to see if models give sensible predictions, but that is itself a pretty frequentist thing to do

Is looking at probability distributions “a pretty frequentist thing to do”? Even when those models and simulations include _prior_ probability distributions? Sure, one can (re)define frequentist to include Bayesian models - as Gelman seems to want to do in that post. I just don’t see how this helps to clarify anything.

>In a situation where you only have one source of information, Bayesian and Frequentist methods usually give the same answer.

Bayesian and frequentist methods always give the same answer because they represent two different ways of translating the same mathematical ideas into English.

What do you call "frequentist methods"?

Imagine the following question: "what's the male/female ratio in gorillas?"

A frequentist method may provide the answer "[1.1 1.3] is a 95% confidence interval" based on taking a sample of zoos and asking them about the sex of the last gorilla born there.

A Bayesian method will provide a different answer - maybe one difficult to reconcile. Because it's not "translating the same mathematical ideas into English". Not only the translation is different - the "mathematical ideas" considered are different as well.

I don't think that's a complete case, you can't just say that the bayesian answer would be difficult to reconcile without offering it.
A Bayesian may put a strong prior on around the 1:1 sex ratio at birth - because in addition to that data regarding a sample of births they incorporate in the calculation knowledge about the plausible ratio coming from previous observations or biological facts about giraffes or related animals - and get a 95% credible interval (which is conceptually completely different from a 95% confidence interval) like [0.99 1.01] or whatever.

You can't just say that Bayesian and frequentist methods _always_ give the same answer without offering even a _single_ example.

What is commonly understood as 'Bayesian methods' will give answers in the form of a probability distribution. What is commonly understood as 'frequentist methods' will never do that. How can they always give the same answer then?

Thanks for adding detail. I didn't offer any examples because I want to use ones that sound representative to you.

The 95% confidence interval is in reference to a probability distribution, I'm not sure what you mean when you say that frequentist answers aren't in terms of those.

As for your bayesian answer, there is a prior that would make their result equal to the frequentist one - and in another example where priors were more obviously crucial (weak evidence) the frequentist would still use Baye's theorem. Their ensemble would be all possible worlds in which they'd ask the question.

> The 95% confidence interval is in reference to a probability distribution, I'm not sure what you mean when you say that frequentist answers aren't in terms of those.

Maybe I should have been more explicit. Let me complete what I wrote above:

"What is commonly understood as 'Bayesian methods' will give answers in the form of a probability distribution FOR THE QUANTITY OF INTEREST. What is commonly understood as 'frequentist methods' will never do that."

In the sex ratio example, the 95% confidence interval [1.1 1.3] DOES NOT mean that the probability that the sex ratio is between 1.1 and 1.3 is 95%. What it means is that when you calculate confidence intervals using this method 95% of them will contain the true value of the parameter. It's not the same thing.

> As for your bayesian answer, there is a prior that would make their result equal to the frequentist one

So it doesn't _always_ give the same answer, does it? (In the best case you get the same answer by twisting what a confidence interval means and only if the "magic" prior is used.)

If you mean that frequentist methods shouldn't be used if they give a different answer I agree!

The problems with frequentist methods have been known for decades: https://www.jstor.org/stable/2281869

They are used because the are easy even though they give the wrong answer (or the answer to the wrong question) - not because they always give the right answer.