| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by ldmosquera 468 days ago

But it's a continuum, not a hard cutoff. They start hallucinating as soon as you query something they haven't learned verbatim, and they hallucinate/extrapolate sucessfully up to a point, beyond which they start bullshitting, maybe up to a further point where they start saying "I don't know".

The key question is where the boundaries are. Maybe they should be part of the response - a per sentence or per paragraph "confidence scale" that signals how hard they extrapolated from their trained space (I know transformers work per token, but sentence/paragraph would be better human UX).

Of course, if they were trained on garbage input, that would only tell you how accurately they sticked to the garbage. But it would still be invaluable instrumentation for the end user, not to mention for the API provider. They could look at high demand subjects with low confidence answers and prioritize that for further training.