| > you cannot generalize from a non-random sample So, honest question: If any survey of any size can be ignored on the basis that the sample is not random, then how is any survey meaningful? Isn’t this a self defeating argue? You can’t prove the sample is random, all you can do is show differences between samples and suggest its not consistent... but how do we go away and prove that some other survey we’re comparing it to is from a random sample? ie. Isnt this just a convenient excuse to deny that a survey is meaningful? Statistically, how do you mathemtaically quantify the effect of selection bias? ...because, it seems to me, unless you can actually do that, you’re just doing some arm chairmhand waving because you don’t like the results youre seeing. This has come up several times (eg. js survey about react vs angular), and no one has ever given me a meaningful and mathematical response. Its always just.. “it must be sample bias”, regardless of the 90000 people they surveyed. I don’t accept you can survey 90000 developers and cannot offer any generalisation from those results without quanatitively proving there is an overwhelming sample bias, and specifically quantifying the degree of that bias. Am I missing something here? Everyone seems thoughorly convinced that this is perfectly normal. (I’m not proud, I’ll take your down votes, but please answer and explain what I’m missing) |
This was the author’s point. Just because you have 90k SO respondents doesn’t mean you can say anything about developers as a population. You can say lots of stuff about SO users. Or maybe developers who use SO. But just because you have lots of responses doesn’t mean you know what developers or jugglers or farmers or whatever population interests you.
The confusion rests with SO’s statement that their survey should be representative of developers in general (or CS graduates or whatever other than only SO visitors).