Hacker News new | ask | show | jobs
by samcodes 1913 days ago
if you use a library for Chinese/Japanese tokenization (which is harder because the lack of space), it seems like the rest of the code would work?