| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by purple_basilisk 1226 days ago
	Good point about hallucinations - low accuracy, high confidence. I wonder if AI will develop the ability to nuance its own confidence. It would be a more useful tool if it could provide a reasonable confidence level along with its output. Much like a human would say, "not sure about this, but..."

3 comments

unknownsky 1226 days ago

I'm not an AI expert so I could be wrong, but it's my understanding that there is a confidence score behind the scenes. It's just not shown in the current UI.

An automated AI system should be able to ask a human for help whenever the confidence score is below a certain threshold or even spit out a backlog of all the tasks it can't confidently handle itself.

link

euroderf 1225 days ago

FWIW, Watson used its internal confidence score when playing Jeopardy.

link

worrycue 1226 days ago

It needs to be able to evaluate its own output. We human do a quick sanity check most of the time before we speak - “On what do I base this assertion?” … etc.

link

Robotbeat 1226 days ago

I wonder if multiple, independently trained LLM‘s could be used in a voting system to determine confidence, or simply call out each others’ bulls**.

link

ChatGTP 1226 days ago

Two wrong systems won't make a right though. Especially when the wrong systems are getting move convincing at being right.

link

Robotbeat 1225 days ago

Two wrong systems can help determine if your answer is wrong if they don’t agree. That’s pretty useful, even if neither is actually correct.

link