Hacker News new | ask | show | jobs
by brianjkim21 829 days ago
yup, we designed an ensemble that considers both lexical & semantic similarity and trained on large datasets of labeled text. also building data pipelines to prevent models going stale.

max 20 docs per request for free api. no official max doc length but recently had issues with large (think book length) docs.