Hacker News new | ask | show | jobs
by bilalhusain 4308 days ago
It looks like you are tokenizing scores file too. That way the prefix information will be lost. As an example,

    admir* 3
    amaz* 3
    interest* 2
should add up positive scores to words starting with admir|amaz|interest.