Hacker News new | ask | show | jobs
by devilsbabe 1813 days ago
Very cool work! Paradoxically, I've been slacking off on practicing Japanese diligently ever since I moved to Tokyo. I've been meaning to get back into a good routine so I'll give jpdb a shot.

I'm curious how you did this: "We have 16785 prebuilt decks with vocabulary from 1124 different anime waiting for you." Did you write a script that calls subs2srs?

1 comments

Well, essentially yes, it is based on text analysis, but it's a lot more complex than "a script that calls subs2srs". (:

(My whole codebase is over 100k lines of code, all written by me.)

I have a unified morphological analysis engine that I use for every type of media that I have in my database, and I use it to generate stats and vocabulary lists.

I didn't mean to imply that what you're doing is simple; I was just curious.

I'll be honest, I have no idea what a unified morphological analysis engine is. Something to look into tonight

I didn't take it that way, so don't worry. (:

Sorry, I might have gotten carried away with all of the fancy words. (: The "morphological" part comes from linguistics (see the "Morphology (linguistics)" article on wikipedia), by "unified" I meant that I apply it to every kind of texts, and an "engine" is just another word for "software" (think "game engine" - a reusable piece of software that can be used for many things in a single domain).