|
|
|
|
|
by girvo
377 days ago
|
|
> Which also makes it interesting to see those recent examples of models trying to sabotage their own "shutdown" To me, your point re. 10 seconds or a billion years is a good signal that this "sabotage" is just the models responding to the huge amounts of sci-fi literature on this topic |
|
(I don't think we're there, but as a matter of principle, I don't care about what the model feels, I care what it does).