Y
Hacker News
new
|
ask
|
show
|
jobs
by
mwsherman
248 days ago
Shameless plug, you may wish to do Lucene-style tokenizing using the Unicode standard:
https://github.com/clipperhouse/uax29/tree/master/words
1 comments
novocayn
248 days ago
Got to admit, initial impressions, this is pretty neat, would spend sometime with this. Thanks for the link :)
link