Hacker News new | ask | show | jobs
by andrew3726 3219 days ago
> Specific details of our network architecture will not be published at this time. DeepL Translator is based on a single, non-ensemble model.

Kinda sad to hear, but completely understandable. I'm curious whether the difference in performance is due to their model specifics or just better training data.

Does anyone have more information?

1 comments

They have the perfect training data as this is a Linguee venture (https://www.linguee.com/). They have millions of translations of paragraphs from one language to another.

I have no information on the model, unfortunately.