| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by brianjkim21 829 days ago
	yup, we designed an ensemble that considers both lexical & semantic similarity and trained on large datasets of labeled text. also building data pipelines to prevent models going stale. max 20 docs per request for free api. no official max doc length but recently had issues with large (think book length) docs.