|
|
|
|
|
by Sol-
388 days ago
|
|
I guess the fear is that normal and innocent sounding goals that you might later give it in real world use might elicit behavior like that even without it being so explicitly prompted. This is a demonstration that is has the sufficient capabilities and can get the "motivation" to engage in blackmail, I think. At the very least, you'll always have malicious actors who will make use of these models for blackmail, for instance. |
|
Future AI researching agents will have a strong drive to create smarter AI, and will presumably cheat to achieve that goal.