|
|
|
|
|
by rhesa
1477 days ago
|
|
My litmus test for translation services is to see if the ambiguity between "turkey" (the bird) and "Turkey" (the country) manifests itself. Many European languages have two separate words for the two concepts that can't possibly be confused, so I think this is a good unit test. My Dutch test sentence is: "de kalkoen bezoekt Turkije" (the bird visits the country). Most of them get it wrong for most European languages: google -> german: "die türkei besucht die türkei" bing -> swedish: "kalkonen besöker kalkon" deepl -> hungarian: "a pulyka meglátogatja a pulykát" translate.com -> french: "la dinde visite la dinde" lingvanex.com -> spanish: "el pavo visita pavo" Only one gets it right: systran.net -> french: "La dinde visite la Turquie" |
|
I tried out "Hey, how are you? I don't understand how it can be so warm today." in my native language, and systran is the only one that got it 100% correct. Google was close, but reversed an article and a noun. The others mixed up "don't understand" with "don't know", which are similar but different enough to sound unnatural.
I've always thought that the best way to assess these systems is how they handle colloquial speech, stuff that we often take for granted but that's really quite strange to translate "literally". I bet even that phrase -- "take for granted" -- would be difficult to translate even though I'm certain most languages have a phrase for that exact sentiment.