|
|
|
|
|
by marksomnian
1048 days ago
|
|
Doesn’t that predispose that it knows which parts of the distribution do and don’t have mistakes, and therefore that it knowingly makes mistakes unless you ask it not to? That doesn’t seem right to me and I’d be really surprised if this actually makes it stop hallucinating - seems more like something you’d put in the prompt without knowing why because it “seems to” produce better output (i.e. cargo cult prompt engineering). |
|
Correctness is something it learns. I've read a few papers about hallucinations, and the jury is still out on whether a model knows when it's hallucinating, if we assume hallucinations are orthogonal to correctness
Now this distinction isn't very useful in the grand scheme of things because in the end the output is wrong anyway, but it doesn't make asking to work along the axis of correctness cargo cult
Further reads
https://arxiv.org/abs/2304.13734
https://arxiv.org/abs/2305.18248