|
|
|
|
|
by stakhanov
2726 days ago
|
|
A bit reminiscent of the "Recognizing Textual Entailment Challenge (RTE)" which was run under the "Text Analysis Conference" umbrella and hosted by NIST until it was discontinued a few years back. An interesting insight from a qualitative analysis of the deviations of submitted answers versus gold standard answers is that it can be explained surprisingly well by: random choice minus publication bias. See here: http://richard.bergmair.eu/pub/thesis.pdf [page 43] That's what the number "53.84%" sounds like to me. |
|