Hacker News new | ask | show | jobs
by orly01 927 days ago
But the moderator AI does not need to understand the meme. Ideally, it should only care about texts violating the law.

I don't think you need to improve that much current LLM so they can detect actual harm threats or hate speech from any other type of communication. And I think those should be the only sort of banned speech.

And if facebook wants to impose additional censorship rules, then it should at least clearly list them, and make the moderator AI explain what are the violated rules, and give the possibility to appeal in case it is doing wrong.

Any other type of bot moderation should be unacceptable.

1 comments

I normally would agree with you but there are cases where what was spoken and its meaning are disjointed.

Example: Picture of a plate of cookies. Obese person: “I would kill for that right now”.

Comment flagged. Obviously the person was being sarcastic but if you just took the words at face value, it’s the most negative sentiment score you could probably have. To kill something. Moderation bots do a good job of detecting the comment but a pretty poor job of detecting its meaning. At least current moderation models. Only Meta knows what’s cooking in the oven to tackle it. I’m sure they are working on it with their models.

I would like a more robust appeal process. Like bot flags, you appeal, appeal bot runs it through a more thorough model, upholds the flag, you appeal, a human or “more advanced AI” would then really detect whether it’s a joke sentiment, sarcasm, or you have a history of violent posts and it was justified.