|
|
|
|
|
by Barrin92
2242 days ago
|
|
> This is why, after decades of controversy, Google Translates still renders the gender-neutral “they are doctors” in German as “sie sind Ärtze” (masculine) and “they are nurses” as “sie sind Krankenschwestern” (feminine). Google Translate was not programmed to be sexist. The corpus of texts it received happened to contain more instances of male doctors and female nurses. [...] BriefCam’s spokesperson added that they used “training datasets consisting of multi-gender, multi-age and multi-race samples without minority bias,” but declined to provide any evidence or details. Okay, will we at some point admit that we want machine learning algorithms to be able to interface with symbolic rules rather than pretending that everything is a data issue as if we're living in the era of 1930s inductionism? It's clear that the misgendering issue here is not a stochastic one that ought to be solved by 'balancing out' data, it's that we want to impose strict linguistic rules and constraints on a system in a clear manner. There really needs to be more work done in AI that makes it possible to interface with the models we built rather than trying to reframe everything as a data problem and then shove it in some end-to-end black box and then hope that whatever comes out at the other end is correct. The automated systems used in the article are supposed to make judgements about "Detection of body movements that constitute assault." This requires genuine understanding and high-level capacity to reason rather than just pixel-based inference from some camera. |
|
A sentence like 'doctor said no' gets translated into 'daktaras pasakė ne'. Even a human could not translate any better unless wider context is known,which could only be derived from other sentences in the paragraph.