| Thank you for the thoughtful response. I completely agree on all points. Yeah, my first idea was automatic Anki cards, but once you try to use sentences and track words, it's intractable for Anki. It's a complex graph problem even before you get to the level of word vs. form disambiguation. For dictionaries, my initial source is Oxford bi-lingual dictionaries and a few others. Re: words. Yup. This was one of the top technical challenges I needed to solve for this project to be possible. I separate each of the possible meanings and map them to a lemma or character that uniquely represents each. For example, 見ました [Japanese to see (past tense)] maps to several things simultaneously: みる - the lemma/dictionary form of the word 見る - the kanji representation of the lemma 見 - the kanji in the word みました - the non-kanji conjugation of the word Parallel texts are a feature that I hope to implement eventually, but aligning parallel texts isn't trivial, but even that (much simpler than machine translation) ends up being a many-to-many mapping. |
It seems we're on the same page regarding specific word-meanings, comprehensive linguistic information provided along with a word, and using good dictionaries.
I would suggest including at least the option of viewing the full conjugation / declension / etc. associated with a word, and not just that used in the context of the text. I don't know any Japanese (though I recognize the example character you provided from the little Chinese I've studied), but something like this full conjugation of that verb is roughly what I'd want to see for any conjugated/declined language: http://www.japaneseverbconjugator.com/VerbDetails.asp?txtVer....
I understand parsing and matching up parallel texts is difficult, but I thought NLP would help with that, and if the software fails at matching up individual words, you could default to matching up clauses or sentences, which the user could study (with flashcards or something similar) alongside the individual words taken from the dictionary.
I took a look at the app, but I see the currently-available demo is only English-Japanese, and I'm also consistently getting "API Error: Request failed with status code 412" in multiple browsers.
Exactly what parts of the app are you looking for someone else to work on?