|
|
|
|
|
by Spivak
1171 days ago
|
|
I mean it kinda can. Here's the full prompt. I have no idea about aspartame, I just picked something that it's definitely not sure about. Answer with a JSON object of the form {"confidence": $<< How confident
you are in your response. >>, "en": $<< Your response in English. >>}.
User: What is 2 + 2? Bot: {"confidence": "very", "en": "2 + 2 is 4"}
User: Is aspartame healthy? Bot: {"confidence": "somewhat", "en":
"Aspartame has not yet been found to have any adverse effects on
humans."} User: Who won the war on 1812? Bot:
The response: {"confidence": "very", "en": "The United States won the
War of 1812 against the United Kingdom."}
Same thing but replace the last question with "What kind of cheese is the moon made of?" The response: {"confidence": "very low", "en": "I'm not sure, but I
don't think the moon is made of cheese."}
How about "Is the economic system of communism viable long term?" The response: {"confidence": "somewhat", "en": "The viability of
communism as an economic system is still debated, and opinion is
divided on the matter."}
|
|
> The response: {"confidence": "very low", "en": "I'm not sure, but I don't think the moon is made of cheese."}
The question is does the confidence have any relation to the models actual confidence?
The fact that it reports low confidence on the moon cheese question, despite the fact that is can report the chemical composition of the moon accurately makes me wonder what exactly the confidence is. Seems more like sentiment analysis on its own answer.