Hacker News new | ask | show | jobs
by Andi 384 days ago
A compound word splitter in JS that splits up words of different (esp. Germanic) languages down to simplexes reliably based on a dictionary, including Segmenter API and SQLite FTS connector. Works very reliably by internally using statistics for known compound words, too, and a stemmer.
1 comments

Sounds neat, got a repo?
May take some weeks.