"To make sure that the popularity of one language over another didn’t skew the results, Vos grabbed an equal number of commit messages per language."
I don't think it really matters. It may be that PHP developers that swear a lot don't use Github, or that Ruby developers that don't swear use Bitbucket, etc..