Hacker News new | ask | show | jobs
by olihb 5313 days ago
We use aho-corasick to extract keywords from terabyte size corpus. Works really well but you have to build your matching tree before.