Y
Hacker News
new
|
ask
|
show
|
jobs
by
kostaj
24 days ago
Awesome. We do plan to human-label the 1,000 claims and then compare Lenz' performance vs the 5 models. We've done some limited internal research with 150 claims, but more are needed for statistical significance.