|
|
|
|
|
by Flere-Imsaho
205 days ago
|
|
Models should not have memorised whether animals are kosher to eat or not. This is information that should be retrieved from RAG or whatever. If a model responded with "I don't know the answer to that", then that would be far more useful. Is anyone actually working on models that are trained to admit not knowing an answer to everything? |
|
The problem with this approach is that it does not generalize well at all out of distribution. I'm not aware of any follow up to this, but I do think it's an interesting area of research nonetheless.
[1] https://arxiv.org/abs/2310.11511