Hacker News new | ask | show | jobs
by sgoranson 5934 days ago
Very cool, and appropriate that you're basically using PG's spam filtering to identify users on his site :)

I think the next step is to write a more complex filter that does not assume word probabilities are independent of each other, i.e. take unusual phrases like "entirely dissimilar" into account.