|
Some results normalized by overall language popularity. Specifically, the entry in row R and column C is 1000 * (hits for C in language R) / (hits for "the" in language R) or "---" if either the numerator or denominator was small enough not to make the top-10 list on github. All the scraping was done by hand and the numbers rounded to a limited number of places in the process, so there may very well be mistakes. [EDIT: oops, initially I failed to paste in the actual data.] ugly hack ugly beautiful lol wtf buggy
xml --- --- 14.330 72.775 2.489 ---
c 6.348 35.421 0.670 0.768 3.650 32.009
html 1.359 9.001 34.791 4.233 3.095 5.870
rb --- --- --- 4.483 3.151 ---
py 9.051 35.614 28.506 --- 11.135 15.066
php 3.731 8.333 1.781 39.037 1.792 3.754
c++ 2.851 12.507 --- 0.501 102.657 7.579
js 6.850 16.826 2.643 2.635 7.406 29.761
Tentative conclusions: Python is ugly-hack-iest and (almost exactly tied with C) ugliest; HTML is most beautiful with Python a close second, XML is lolliest, C++ is WTFiest, and C is buggiest.Tentative meta-conclusion: these numbers have no value beyond idle amusement. But they idly amused me, so that's OK. (The weirdest result of the lot, to me, is XML coming top for "lol". If you do the search and click on "XML" on the left you'll see why it is. Lots of instances of what I think are the same file, full of "&lol;" entities. LOL, that's pretty ugly. WTF? An ugly hack, I guess.) |