| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kostaj 24 days ago
	Awesome. We do plan to human-label the 1,000 claims and then compare Lenz' performance vs the 5 models. We've done some limited internal research with 150 claims, but more are needed for statistical significance.