|
|
|
|
|
by dontupvoteme
936 days ago
|
|
How do you account for colloquial (non-English) language which could be naively misconstrued as toxic? e.g. "geil" (either cool or horny depending on usage) in German It's not fundamentally different than e.g. "wicked" in English, but the biggest bias that potentially all these ML models exhibit is predisposition towards Anglophoneism |
|
You can find details on how the multi-language creation of the toxicity lists was done in section 7.3 of the NLLB paper: https://arxiv.org/pdf/2207.04672.pdf. TLDR: it's not just a translation of a base English list, even if we started from that, each language has a curated list that was built by professional translators.