The Instagram API returns an APINotAllowedError (response code 400) when you search for one of these hashtags. I'm using it to automatically identify and categorize NSFW hashtags for one of my projects, HashPix. You can see it in action at http://hashpix.herokuapp.com/
This list contains eg. both #sex and #thesexpistols. It seems more likely that anything containing the substring "sex" is banned. The list, then, is just examples, rather than some exhaustive, official blacklist.
In a country with 35.7% obesity (e.g. 203 pounds at 5'9" = obese), you'd think it would be a good thing to let people inspire others to get back to normal weight.
I've been lazy about writing a good filter to weed out the nasty ones. I considered quite a few approaches - ranging from an NLTK-based algo to basic brute-force list-search with word-stemming. Then I realized, it would take me a lot more time to develop that, than the rest of the project.
So, I chose the easy way out. Turn out it saved me quite a bit of effort, too. :)
I can confirm this. The search term alluded to, as well as all hashtags stemming from it, have been auto-tagged as nsfw in HashPix as well.
OT: We have just managed to carry out a conversation about a word that was not even mentioned once within the conversation itself. And yet, each one of us knows exactly which word we are alluding to. Voldemort, anyone? :)
The Instagram API returns an APINotAllowedError (response code 400) when you search for one of these hashtags. I'm using it to automatically identify and categorize NSFW hashtags for one of my projects, HashPix. You can see it in action at http://hashpix.herokuapp.com/