|
|
|
|
|
by eru
1157 days ago
|
|
I think you are mixing up layers of abstraction. The network is most likely trained with something like a categorical cross entropy loss function. Those totally punish being wrong a lot more than saying "I don't know". See https://www.v7labs.com/blog/cross-entropy-loss-guide It's just that saying "I don't know" means that your model is spreading the probability of what the next token in the text stream might be over many different outcomes. A very 'uniform' probability distribution, instead of sharp prediction. That looks very different to GPT literally outputting the words "I don't know". |
|