| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by stakhanov 2726 days ago

A bit reminiscent of the "Recognizing Textual Entailment Challenge (RTE)" which was run under the "Text Analysis Conference" umbrella and hosted by NIST until it was discontinued a few years back. An interesting insight from a qualitative analysis of the deviations of submitted answers versus gold standard answers is that it can be explained surprisingly well by: random choice minus publication bias. See here:

http://richard.bergmair.eu/pub/thesis.pdf [page 43]

That's what the number "53.84%" sounds like to me.