|
|
|
|
|
by semitones
312 days ago
|
|
Furthermore, it is very rare to have the following kind of text present in the training data: "What is the answer to X?" - "I don't know, I am not sure." In this situation very often there won't be _any_ answer, plenty of difficult questions go unanswered on the internet. Yet the model probably does not interpret this scenario as such |
|
Have a series of pretraining sessions with training data where specific information is not present and training questions/answers of "I don't know" for that data is also trained on.
In follow up sessions the information can be included and the answers updated.
Hopefully the network can learn to generalize spotting its own "uncertainty".