Hacker News new | ask | show | jobs
by CM30 841 days ago
The problem is also that standards for what's 'acceptable' discourse vary wildly between countries and regions and subcultures and other groups. This might theoretically be ignorable by saying "well, everything will be run based on what's acceptable among tech bros in Silicon Valley", but that might in turn annoy many governments in Europe and Australia and South America and Asia for either being too lax on the restrictions/moderation or too strict on it.

For example, there are certain words which are highly offensive in some regions, but completely fine in others. Some pictures and imagery which are banned or seen as offensive in some places but not others, and other such differences.

There's also often a lot of subculture specific terminology that might come across as offensive to those outside said culture if they don't know the context or situation too. We've seen that with certain tech related terms in the past.

Political misinformation and outrage bait is similarly not always simple to classify. Is content from a satire site that too many people take as real 'misleading'? I know there have been problems with things like The Onion, the Babylon Bee, Hard Drive, etc where people took their stories at face value. It also often falls into 'things the government/those in power don't want others questioning' too, especially when it comes to certain political events and controversies.

Let's not even get into talking about conflicts of interest and scenarios where business value and fairness clash, like when certain political figures say things that would probably get any other account suspended. What if the US president, UK prime minister, president of the EU or some country leader promotes information that's abusive, misleading or dangerous? Twitter had to deal with that problem back in 2016, and just ended up pretending he didn't exist until his term in office was over.

And then there's just context. Is an insulting message aimed at someone an attack by a rival or bully or troll? Or banter from one of their friends, like how many friend groups and families can make friendly jokes at each other's expense? On smaller communities this isn't much of a problem since the people there know each well enough to tell the difference between personal attacks and jokes, but is that going to be the case with a moderator here?

So moderation gets kinda tricky due to all the context needed to know whether a message is abusive or not. A well moderated large platform probably needs people in a variety of locations, from a variety of backgrounds, with some sort of way of getting a group consensus if any staff member is unsure.

Of course the other incentives you point out don't help much either. Google and co want to automate everything, so the idea of using humans to tell the difference between quality and spam/abuse is never even considered by them. Some things like outrage bait are definitely supported by the platforms for their addictiveness, and then we get situations where the owners themselves are horribly biased/trolls/whatever and are happy to allow abuse so long as its from 'their team'.