|
|
|
|
|
by cpitman
439 days ago
|
|
I was looking for a more direct measure of this, how often a model "leaked" private state into public state. In a game like this you probably want to sometimes share secrets, but if it happens constantly I would suspect the model struggles to differentiate. I occasionally try to ask a model to tell a story and give it a hidden motivation of a character, and so far the results are almost always the model just straight out saying the secret. |
|