Hacker News new | ask | show | jobs
by 3pt14159 5209 days ago
I've had much, much better results with LDA than LSI. Give that a shot if you have a chance, you'll be blown away. Stop word ratios are important, and make the max number of tokens 500,000.