Hacker News new | ask | show | jobs
by w457uiw4gftyi 2447 days ago
> What irks me the most, is the lack of server-side language support for cloud offerings, on-premise Enterprise products, etc. If Microsoft can support Klingon for translation purposes, then just maybe they could support Inuktitut a little better.

I've noticed that in terms of language support in general. Of course, languages with Western and/or wealthy populations seem to have better support. In Elasticsearch, I can set up a field for Irish or Latvian or Basque analysis out of the box (each with ~1 million speakers or fewer). But Zulu or Uzbek or Malay (each with 10 million+ speakers), and sorry, no support.

1 comments

Part of this may come down to corpus availability. The EU requires that much of its official document production be translated in the official language of every member country (which would include Irish and Latvian). With essentially bilingual translation pairs, it's much easier to feed the data into machine translation; and the need to feed this translation machine for official translation also means it's relatively easy to contract out translation work if you need to.