Hacker News new | ask | show | jobs
by Galanwe 484 days ago
> 2. All those "bad kebabs" actually located within 500m from the nearest station. No kebab located in further than >500m is bad.

Right, but this is selection bias. There will always exist a distance D from which all bad kebabs are located.

Unless D is provenly chosen _before_ looking at the data, this has no meaning.

One also has to take the kebab density into account.

1 comments

This "you have to choose D" ahead of time nonsense is why people distrust and dislike statisticians! Humans have priors on what is "close" that are independent of this particular article. If they had said "See, everything within 5000m" or "everything within 5m" you might have a point but "500m" being a rough definition of "close to a train station" is pretty reasonable.
> If they had said "See, everything within 5000m" or "everything within 5m" you might have a point

On the contrary, if everything was within five meters, that would make the finding much more impressive.