|
|
|
|
|
by ndr_
53 days ago
|
|
These prompts chain several known LM exploits together. I ran experiments against gpt-oss-20b and it became clear that the effectiveness didn‘t come from the gay factor at all but can be attributed to language choice or role-play. Technical report: https://arxiv.org/abs/2510.01259 |
|