Hacker News new | ask | show | jobs
by bjornsing 1134 days ago
I must admit: this went against my intuition. My first guess was that you would end up with an urn full of either red or blue balls.
4 comments

Bear in mind the default behaviour if there were just two balls and you never added any more. Then the proportion of red picks vs blue picks would tend to 1/2. So there's naturally a tendency for the proportion to concentrate in the middle.

As you say, the way in which new balls are added tends to push the proportion towards the extremes.

The uniform distribution is the result of these two tendencies exactly cancelling out.

This is not correct as stated. It does not “tend towards extremes” as you might expect intuitively. That would be the case if half the time it approached 100% blue and half the time it approached 100% red, which is precisely not what happens.
For me as well. And when my stochastic probability professor posed this question to the class by way of hands, it was nearly unanimous in favor of the 0/100% end behavior.
Yeah, I don't know what my intuition was.

But the problem is symmetric, and even if pick a red, you end up with two reds and one blue, so not so much imbalanced.

And even if the mix becomes really imbalanced, say 7 red and one blue, picking the rarest color will have more effect than picking the most common one. So you could consider that the system tries to balance itself naturally, hence avoiding huge swings in some direction or the other.

Me too, but only because I was expecting something interesting to happen since it was on HN!
The uniform distribution result is not interesting enough for you?
Oh, I see! Usually "this sort of thing" would have a normal distribution I guess? That IS quite interesting.
Usually 'this sort of thing' would go to the extremes: colours that are already prevalent have a bigger chance of getting more added to them.

It's interesting that they don't.

Normal distribution would be extremely weird and unexpected (and not even really possible): we know for sure that the proportion in the end has to be between 0% to 100%. Normal distributions don't have such cutoffs.

On a log-odds (logistic) scale, though, we get something that resembles a normal (a logistic distribution).
Yes, a uniform distribution in the space of probabilities between (0,1) would give you something bell-curve shaped in the log-odds scale.

The log-odds scale also makes it clear why probabilities 0 and 1 are very special: they only show up as limits in the log-odds scale.