Hacker News new | ask | show | jobs
by derefr 2179 days ago
> blows GT right out of the water with its accuracy.

For general vocabulary, very likely. I believe Google Translate's competitive edge comes in the form of figuring out translations for jargon, slang, etc. from equivalent-corpus context.

I like to think of Google Translate as a keyword extractor on steroids. It doesn't necessarily give you the right prose, but it does better than anything I know of at giving you the right bag-of-words for indexing a foreign-language document in English.

(I hypothesize this to be Google Translate's real driving purpose, and the reason it still sees regular updates after all these years: it's used to index foreign-language web pages, books, and videos so that there can be a single TF-IDF token in Google's backend for each language-neutral conceptual category rather than distinct token for each language-specific word.)

1 comments

Don't know about slang, but I recently translated some technical documentation from German to English. I was surprised how many specific terms deepl knew, and even if it failed at that the grammar surrounding the offending term was still mostly correct. It was night and day compared to Google. Bag of words describes it well.