| > here is an ungrounded, non-realistic, non-representative of a potential future intuition pump to just get the feel of things: > (Yes, there are many holes in this, like how would it piggy back off of our infrastructure if it kills us, but this isn't really supposed to be coherent, it's just supposed to give you a sense of direction in your thinking. Generally though, since it is superintelligent, it can pull off very difficult strategies.) If you read the above I think you'd realize I'd agree about how bad my example is. The point was to understand how orthogonal goals between humans and a much more intelligent entity could result in human death. I'm happy you found a form of the example that both pumps your intuition and seems coherent. If you want to debate somewhere where we might disagree though, do you think that as this hypothetical AI gets smarter, the interface between it and the physical world becomes more guaranteed (assuming the ASI wants to interface with the world) and less tenuous? Like, yes it is a hard problem. Something slow and stupid would easily be thwarted by disconnecting wires and flipping off switches. But something extremely smart, clever, and much faster than us should be able to employ one of the few strategies that can make it happen. |
If the AI does something in the physical world which we do not like, we sever its connection. Unless some people with more power like it more than the rest of us do.
Regarding orthogonal goals: I don't think an AI has goals. Or motivations. Now obviously a lot of destruction can be a side effect, and that's an inherent risk. But it is, I think, a risk of human creation. The AI does not have a survival instinct.
Energy and resources are limiting factors. The first might be solvable! But currently it serves as a failsafe against prolonged activity with which we do not agree.