|
|
|
|
|
by goldfeld
666 days ago
|
|
Then congrats all the more on this work, I had assumed a motivation from attachment also. As for the tech stacks and langs yeah they're fun, but I meant in how abstract Inuktitut looks, I guess the human language in the loop is the real meat of the problem and of the "working with". |
|
I made a few syllabics converters for Ojibwe when I was younger, but Ojibwe and Cree are harder to build converters from than Inuktitut because they have a lenis-fortis distinction in the latin script but not syllabics, so the word "anishinaabe" can be converted unambiguously to ᐊᓂᔑᓈᐯᒃ, but trying to convert the other way could render any one of anishinaabe, ani*zh*inaabe, anishinaa*pe*, or ani*zh*inaa*pe*. I don't think it could be done accurately without some sort of machine learning. There's also a lot more complexities surrounding how different dialects are encoded. In the end, Inuktitut was simpler to work with and has a lot more everyday speakers and text to work with, especially because the Canadian territory of Nunavut publishes its government documents and websites in Inuktitut too.
There's a lot to cover but I think the technical challenges might be interesting to the Hacker News crowd, I may make a series of videos about it someday!