Hacker News new | ask | show | jobs
by notaurus 977 days ago
> The next time somebody claims that the human brain operates according to bayesian logic, simply ask them what prior probability they (currently, until the next update) would assign to that claim.

A = a person is able to (correctly or not) quantify their priors

B = brain operates according to Bayesian logic

P( A ) ≈ P( A | B )

1 comments

So you're saying that P(A) is almost independent of B, as in

P( A ) ≈ P( A | B ) ≈ P( A | ¬B )

?

I suppose my argument is that if someone is using priors that are either a hard 1 or a hard 0, they've removed themselves from the ability to use Bayesian logic at all in any situation where data points in the opposite direction of their priors.

While you can still call it "Bayesian" if you insert a prior of 0, I think such an argument is a direct contradiction of the purpose of using Bayesian logic.

In other words, I would argue that P( ¬A | B ) ≈ 0. Refusual to admit a prior greater than a hard 0 is not compatible with Bayesian logic. Prior probabilities of exactly 0 should be seens as outside of the valid domain within Bayesian logic under most circustances.

Maybe we are talking at different levels.

The people theorizing that the brain is Bayesian are not saying that humans do it consciously. Like they can examine priors and making decisions.

It is just that the neurons in the brain update in a way that can be modeled roughly as Bayesian. It happens without us 'deciding to do it', it is just how the brain updates to process the environment. It is happening continually, as we take in senses, and update our internal model.

Sure. I understand that part. 99% of humans wouldn't even be able to do it consciously if they tried, so this has to operate on the subconscious level, to the extent that it's an accurate model.

But if you start out with randomized priors, you will never reach 1 or 0 regardless of how much data you expose a bayesian system to. Humans, on the other hand, tend to fall into treating a probability as either 0 or 1 quite rapidly. For this step in particular, there seems to be something like rounding or L1 regularization going on.

Then, once they're stuck in 0 or 1 priors, people often revert to using evaluation similar to Bayes' Theorem again, but in that case the priors can no longer be updated (except through something like a psychological shock, hallucinogen etc).

But, as stated above, you cannot really reach such priors using Bayes' Theorem alone (if we assume the priors are not provided by genes or something that happens before the learning).

Along these lines, what is happening when someone walks into a room and sees a snake, but then does a double take and sees that it is a rope.

It seems humans do have miss-identifying visions. Where they categorize something fast but incorrectly, and then might need 'help/time/shock', to kick start re-categorizing. To re-see it again.

Or Like when seeing something for the very first time and they are 'befuddled', can't grasp it. Maybe like Bayes is having to iterate on it much longer.

I'm curious since I'm not all that familiar with Bayes. Is this what you are talking about with 1 or 0. People do make very rapid judgments, then settle on an a 'view'. Then it can take something to make them re-adjust.

> Is this what you are talking about with 1 or 0.

It mostly has to do with the zeroes. Take Bayes' theorem:

  P(A|B)=P(B|A)P(A)/P(B)
The prior for some hypothesis A is P(A)

If you start with no knowledge, P(A) can be something like 1/n, where n is approximately the number of competing hypothesis.

B is the "data" part, a weighed compbination of all data used to update P(A).

P(A|B) is the posterior probability for A, given the data B. This posterior is then used to update your prior P(A) before you are exposed to new data.

Take the hypothesis "God exists as a male person". Before any data, maybe you give this a 1/4 prior probability. Add the Bible as your only source of data, P(A|B(ible)) may go to 9/10. Add all other religious texts, and maybe it goes to 1/2. Add all of Science, and maybe it goes a bit lower.

Now what happens when we set a probability to 0?

If P(A)=0, then P(A|B) stays at 0 regardless of how much data you pile into B. A prior of 0 makes it completely impossible to convince you that A is true.

Or, lets say you have two competing hypotheses:

  A1: God is exists as a male person:

  A2: Hypothesis A1 is false. (God either does not exist or is not male or not a person).
Since A1 and A2 cover all possible states, P(A1)+P(A2) = 1.

In other words, if you set P(A1) to 1 you simultaneously set P(A2) to 0.

And as above, when you have a prior of 0, it will never get updated using bayesian logic, so regardless of any evidence to the contrary, P(A1) will remain 1.

Only in a situation where P(B|A1) is also 0 could you ever doubt A1. When this happens, a person may live through severe cognitive dissonance or crisis, as if all of reality falls apart. The person will have to choose to discard A1, or to find some reason to ignore the data B.

For instance, if you believe 100% that some organization (any organization, but let's pick Hamas this time) are the good guys (hypothesis A), and some event B (like a massacre of civilians) seems to strongly oppose that belief.

Lets say P(B|A) is very low in this case.

If you have built your existence and identity around P(A)=1.

Now, instead of believing B really happened, you can introduce an alternative (conspiracy theory) view on the data, for instance:

  C: It was really Mossad that tricked Hamas into attacking Israel.
In this case, P(C|A) can be quite a bit higher than P(B|A).

Now, if you extend this, you can even turn this around. If you see A as the DATA not as the Hypothesis, you can set up

  P(B|A) = P(A|B)P(B)/P(A)

  P(C|A) = P(A|C)P(C)/P(A)
While other people take B simply as data, you have now turned it into something to be disproved. And if you are certain that P(A)=1, you can believe almost any conspiration theory C as long as P(A|C) > P(A|B).

And this can cascade. If P(A|B) is small enough, then you can end up being certain that P(B) = 0 and eventually that P(C)=1.

Anyway, the above type of reasoning is at the root of religious and dogmatic thinking. In fact even a single false belief held with a prior probability of 1 can be used (through Bayesian logic) to prove almost anything.

And this is not limited to happening by accident. A skilled demagogue who can trick the audience into accepting (with no room for doubt) a single false or inaccurate claim, can then use to manipulate them into believing almost anything.

(Obviously, in the Hamas/Isreal conflict, this goes both ways.)

Maybe it is just an engineering problem.

Humans aren't strictly math computations that can arrive at a absolute 0 or 1.

Maybe humans have a bit of a random number generator that kicks in some noise to avoid reaching absolute 0 or 1.

Allows some re-evaluation.

Maybe that would be option in AI to avoid the problem. During one of the games with AlphaGo it seemed to get stuck and was treading water making really 'plain or lackluster' moves. Until it kind of got unstuck. It seems to be similar?