| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by hun3 449 days ago
	Quoting a paragraph from OP (https://www.anthropic.com/research/tracing-thoughts-language...): > Sometimes, this sort of “misfire” of the “known answer” circuit happens naturally, without us intervening, resulting in a hallucination. In our paper, we show that such misfires can occur when Claude recognizes a name but doesn't know anything else about that person. In cases like this, the “known entity” feature might still activate, and then suppress the default "don't know" feature—in this case incorrectly. Once the model has decided that it needs to answer the question, it proceeds to confabulate: to generate a plausible—but unfortunately untrue—response.

1 comments

trash_cat 448 days ago

Fun fact, "confabulation", not "hallucinating" is the correct term what LLMs actually do.

link

mystified5016 447 days ago

Fun fact, the "correct" term is the one in use. Dictionaries define language after the fact, they do not prescribe its usage in the future.

link

trash_cat 447 days ago

Confabulation means generating false memories without intent to deceive, which is what LLMs do. They can't hallucinate because they don't perceive. 'Hallucination' caught on, but it's more metaphor than precision.

link