Hacker News new | ask | show | jobs
by mtkhaos 1043 days ago
By odds and in the context of RLHF the human could very well thumbs up the output without recognition of said hallucination.

As hallucination is a general term giving to said phenomenon. Otherwise the question Emerges why 2023? And why was that not known colloquially before hand. When the basis of these Algorithms come the the 80s?

1 comments

Because models got sophisticated enough that wrong outputs started looking plausible answers rather than random garbage. Sometimes they still spew random garbage though.
But isn't a binary absolute in this case and cannot be used in the basis of first principle's assumption of truth.