Y
Hacker News
new
|
ask
|
show
|
jobs
by
crawdog
2016 days ago
Looks like they are using a blanket stop word list - "I, the, and" etc. Also looks like diacritical folding has been done for accent characters.