Hacker News new | ask | show | jobs
by lautis 5016 days ago
There are libraries to handle stemming and Unicode equivalence which are easy to add into this kind of boolean search. If ranking documents would definitely mean that some other approach, e.g. vector space model, should be used.

https://github.com/aurelian/ruby-stemmer http://unicode-utils.rubyforge.org