|
|
|
|
|
by mlyle
474 days ago
|
|
I don't know what you mean, then. They tried lots of fine tuning. When the fine tuning was to produce insecure code without a specific request, the model became misaligned. Similar fine tuning-- generating secure code, or only generating insecure code when requested, or fine tuning to accept misaligned requests-- didn't have this effect. |
|
Producing insecure code isn't misalignment. You told the model to do that.