|
|
|
|
|
by staticshock
6 hours ago
|
|
Don't let your guard down. Tricking Opus 4.6 is not impossible, it's just still an active research frontier. Once the right incantation for any specific model is known, it'll be weaponized. There was an excellent article on the front page recently about role confusion, which highlights just how just far models have to go on this: https://role-confusion.github.io/ |
|