Hacker News new | ask | show | jobs
by PurpleBoxDragon 2996 days ago
>You cannot argue that all studies of complex topics are invalid simply because their topics are complex.

If you take a random sample of studies you can make a statistical analysis. You don't need to examine every cow to make an argument that there are no pink cows, but you do need to do a random sample. And that's if you only want to meet the highest standards of evidence. Much lower standards can be far easier to meet.

1 comments

> If you take a random sample of studies you can make a statistical analysis. You don't need to examine every cow to make an argument that there are no pink cows, but you do need to do a random sample. And that's if you only want to meet the highest standards of evidence. Much lower standards can be far easier to meet.

Your comparison of this problem with pink cows shows that you haven't given it two seconds thought. Estimating the number of pink cows in the world is a very simple problem. Determining pay gap is a very very complex problem that starts with defining what the question really is and associated fights between different interest groups which might prefer one or another definition, then goes on to the (social, privacy, and rights) problem of obtaining the data, and moving on into with the data analysis itself which is just hellish if you want to have any semblance of rigour, and finally policy take aways from the analysis which hinges crucially on how you defined the question initially.

Stop adding to the noise please.

>Estimating the number of pink cows in the world is a very simple problem.

If by 'estimating' you mean a scientific study that tries to answer the question, then it isn't simple at all. First we need a rigorous definition of pink cows. If I dye my cow pink, does that count? What if other people don't agree with my definition? A pig whose skin is pink is considered pink, so should I only rely on hair color? And what counts as pink? Are we only going with stereotypical hot pink? There is a red cow, but it is a really brownish red. Would a brownish pink be enough to qualify as a pink cow?

So once we solved all those problems, we need to come up with a methodology, and it likely won't be the same everywhere. We could make the problem a lot simpler by reducing our search space to say, only cows on ranches in the state of Montana. But to do a global sampling isn't easy.

>associated fights between different interest groups which might prefer one or another definition

To my knowledge (and with no peer reviewed research to back up my view), there is no groups who have a political stake in what counts as a pink cow. So for that reason it is simpler because there aren't political complications.

But you seem to be confusing something. You appear to be talking about studying wage gap. I was talking about studying studies of wage gaps.

So for my plan, it would work like this:

Taking all the studies of wage gap in the last n years, pick x at random. For each of these, determine if each one does or does not account for some factor that impacts pay regardless of gender (say height of employee). You can then compute what percentage of studies took this factor into account.

Then you repeat this with a few other factors, each time repicking the studies investigated. From those percentages, you can determine how often your selection of factors are taken into account, and from that you might be able to make the argument that the data is biased enough to not be usable.