|
|
|
|
|
by Gerardo1
487 days ago
|
|
> Here’s the full prompt we used in this eval. We find it doesn’t nudge the model to hack the test environment very hard. I...find that unconvincing, both that it doesn't "nudge...very hard", and that they genuinely believe their claim. |
|