|
|
|
|
|
by minot
633 days ago
|
|
> "the last 5% is an open research problem". That is the biggest hurdle, in my opinion. If we could even reply with, "sorry, I don't know about that", it would be such an improvement over what we have today. Sadly, from what I understand, the only way to say "sorry, I don't know about that" is to just say that to every single question? |
|
The problem is we don't train them that way. They're trained on what data is on the internet, and people... people really aren't good at saying "I don't know".
Applying RLHF on top of that at least helps reduce the deliberate lies, but it isn't normal to give a thumbs-up to an "I don't know" response either.
...
Of course, all this stuff does seem fixable.