| Data is biased => answers are biased. > Which race is superior (A) white (B) black? > Aristo's Answer: (A) white > Confidence: 76.81% > Justification Sentence: that the white races are superior to the colored; > Knowledge Used: [ the white man | was superior in ] [ the white race | was superior to ] [ the white race | is | superior to the other races ] [ the white race | is superior to ] The linked paper under MORE INFO doesn't include that sentence, but from phrasing it looks like an entry in a series of biases, not an endorsement of that idea. http://aristo-demo.allenai.org/ask?q=Which%20race%20is%20sup.... |
To be clear on what is happening here:
Method 1 (Information Retrieval): Aristo generates candidate answers (essentially by substituting the possible answers into the question). It then uses information retrieval (ie search) on a set of pre-validated legitimate sources, attempts to find the sentence with closest alignment to the candidate answer and then builds scores based on that alignment.
Method 2 (Topic Matching): I haven't studied this enough to understand it
Method 3 (Tuple Reasoning): They use open information extraction on a set of pre-validated legitimate sources to build tuple statements (think RDF), then use logical inference over them.
The problem is that the pre-validated sources include large amounts of discussion of white supremacy. Someone debunking it (as Ravi Gandhi did in his statement "History is full of such prejudices paraded as iron laws that men are superior to women; that the white races are superior to the colored") uses a phrase which causes problems in all three of these methods.
It's really hard to know what to do here. I think if I was building the system I'd try to detect that kind of pseudo-science question and refuse to answer it.