Hacker News new | ask | show | jobs
by shawnps 3978 days ago
Interestingly, a search for "retard" in code on GitHub yields over 200,000 results:

https://github.com/search?q=retard&type=Code&utf8=%E2%9C%93

Not sure what made this particular case different.

3 comments

Most of those are badwordlists.
On the first couple pages, yeah. Skip 10+ pages and you get stuff like "nigger stole my bike" etc.

As a side question, shouldn't it be fairly easy to flag those repositories and then sort through those who have it as an offensive word list and those who are just being derogatory?

Probably that someone complained
I don't understand why they don't filter it on upload stage ("push"). "git push" > "sorry, can't push because of words: [list of words here]". Is it more difficult than later disable whole repo because of one stupid joke?
Because you may need to put those words in a repo in a non-offensive context. For example, a list of "bad" words to look for.
I'm against censorship at all, but when humans are involved in process, it becomes much worse.
You could work with hashed variants in that case; there might be performance benefits there, too.