| > model for the truth? Without sensing/experiencing the world, there is no truth. The only truth we can ever truly know, is the present moment. Even our memories of things that we “know” that happened, we perceive them in the now. Language doesn’t have a truth. You can make up anything you want with language. So the only “truth” you could teach an LLM, is your own description of it. But these LLMs are trained on thousands or even million different versions of “truth”. Which is the correct one? |
https://arxiv.org/abs/2212.03827
Another approach - a model can learn the distribution - is this fact known or not in the training set, how many times does it appear, is the distribution unimodal (agreement) or multi-modal (disagreement or just high variance). Knowing this a model can adjust its responses accordingly, for example by presenting multiple possibilities or avoiding to hallucinate when there is no information.