Y
Hacker News
new
|
ask
|
show
|
jobs
by
crawdog
2138 days ago
You could build a bloom filter with the profane words, then check your tokens for the page against that filter.
2 comments
teraflop
2138 days ago
That would make sense if your list of expletives was too large to fit in memory, which would be... impressive, to say the least.
link
GordonS
2138 days ago
I'm Scottish, and I'll say it's definitely possible :p
link
Keyframe
2138 days ago
if profanities were single words only. What if they weren’t? You’d have to have a giant list of permutations and build a huge-ass bloom filter. Still doable though, but then spelling errors (or not)...
link