Linguist tries to filter out things like that:
https://github.com/github/linguist/blob/master/lib/linguist/...