I've been lazy about writing a good filter to weed out the nasty ones. I considered quite a few approaches - ranging from an NLTK-based algo to basic brute-force list-search with word-stemming. Then I realized, it would take me a lot more time to develop that, than the rest of the project.
So, I chose the easy way out. Turn out it saved me quite a bit of effort, too. :)
I've been lazy about writing a good filter to weed out the nasty ones. I considered quite a few approaches - ranging from an NLTK-based algo to basic brute-force list-search with word-stemming. Then I realized, it would take me a lot more time to develop that, than the rest of the project.
So, I chose the easy way out. Turn out it saved me quite a bit of effort, too. :)