Hacker News new | ask | show | jobs
by crawdog 2016 days ago
Looks like they are using a blanket stop word list - "I, the, and" etc. Also looks like diacritical folding has been done for accent characters.