Hacker News new | ask | show | jobs
by janalsncm 393 days ago
Vector based approaches either don’t handle OOV terms at all or will perform poorly, depending on implementation. If you limit to alphanumeric trigrams for example you can technically cover all terms but badly depending on training data.