Hacker News new | ask | show | jobs
by turtledove 1477 days ago
Found a slur in the dataset, not sure if these are programmatically generated (I assume so) but may want to consider filtering those out.
2 comments

It's worth pointing out that they are slurs in the U.S., not globally. One is a type of meatball where I'm from.
Yep. But if your user base is likely to include Americans, then you might want to consider filtering them out. (As it could be shocking to read.)

Note the word consider, by the way. I'm not demanding anything here, I'm saying the author should make a deliberate choice about their inclusion rather than including them just because they were auto generated.

For the curious, I think they mean decimal values 16408693 and 16410119. Which have other archaic meanings but their primary modern meanings are a slur. 16410349 is apparently British slang for exhausted, so maybe gets a pass?