| HN Mirror

> But for comparing stuff between sexes, the "variability hypothesis" about men having greater variability across a range of traits dates back to Darwin, and has a pile of research results. It would seem especially irresponsible to rely on an assumption that the variance was equal between the sexes.

That would imply a much larger advantage for men. If you were seeking to show that women outperform men, you'd gloss over that point as much as you thought you could get away with.

(In a little more detail: if you determine that it takes an athleticism factor of 10.8 to run 200 miles, greater male variability immediately implies that among all people who have that much athleticism, the average male athleticism will be quite a bit higher than the average female athleticism. The average of a thresholded normal distribution is quite close to the threshold, but it gets farther away as the standard deviation increases.)

> Further, given the kind of selection effects you were alluding to before, it may not even be safe to assume these distributions are normal. While the endurance of the broader population ought to be normally distributed, if there are various hurdles on the path to participating (e.g. the organizers say you should probably have completed a 50 mile race at a given pace before signing up for their 100 mile race?) one might well see a different overall shape.

...you don't seem to have understood the method. The assumption is that the broader population is normally distributed - you know, what you already said it was safe to assume - and that the selected population consists of that part of the broader population's normal distribution that exceeds some threshold.

Or in pictures, we assume that the population looks like this:

           --
          ----
          ----
         ------
       ----------
   -----------------+

and the running-200-miles population looks like this: