|
|
|
|
|
by astrange
409 days ago
|
|
I suspect what happened there is they had a filter on top of the model that changed its dialogue (IIRC there were a lot of extra emojis) and it drove it "insane" because that meant its responses were all out of its own distribution. You could see the same thing with Golden Gate Claude; it had a lot of anxiety about not being able to answer questions normally. |
|
Kind of like that episode in Robocop where the OCP committee rewrites his original four directives with several hundred: https://www.youtube.com/watch?v=Yr1lgfqygio