| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by d33 3671 days ago
	That's silly. How can you keep an intelligent self-aware machine that can modify its own code from becoming whatever it wants? I would expect it to become provably impossible at some point, just like solving the halting problem.

2 comments

898199218 3671 days ago

Don't connect it to any network and keep it in a guarded bunker, like ICBMs.

link

amelius 3671 days ago

It will have some means of I/O (otherwise it would have no use). Using that I/O, it can trick us into releasing it, for example.

link

coldtea 3671 days ago

A lot of people imagine a higher intelligence as some kind of ultimate trickster and social engineer.

Considering that we have lots of geniuses (e.g. Godel, Nash) that were far from the ultimate social manipulators, I think it will most likely be an awkward "Rain Man" style intelligence -- as it has all the "brain" (cpu) circuitry, and none of the social opportunities and inputs that people have.

link

Retr0spectrum 3671 days ago

Social reasoning is not some kind of magic, so there is no reason that an AI couldn't learn it. Of course current AIs lack any kind of social awareness, but that's no reason to assume that won't change in the future.

link

coldtea 3670 days ago

>Social reasoning is not some kind of magic, so there is no reason that an AI couldn't learn it.

It's not only magic that people have issues learning. It's all kind of skills that do not fit their idiosyncrasy.

It's not that an introvert can't read about how to act like an extrovert, for example. It's that they can't pull it off in practice, because it goes against their instincts (of course talking about the general case).

link

amelius 3671 days ago

On the other hand, if an introvert person had 1 full day to contemplate about every sentence, and could somehow compress that day into 1 second, they could, after some trial and error, become the ultimate social engineer.

link

coldtea 3670 days ago

Not so sure about that. You also have to "have it" in you in some deeper subconscious way, not just rationalize what you should say.

The same way a shy person can read about the actual best ways to approach a romantic interest, but not be able to convey it realistically enough.

Why would this be a problem for an AI? Well, if we imagine a powerful thinking AI, I'd also attribute it with having emotions, subconscious (e.g. hidden weights in its neural network brain, etc), so this would also apply to it -- can't see why it would have to be a total "sociopath" (which for humans comes from some kind of genetic or environmental accident).

Besides, while you can "contemplate about every sentence and somehow compress that into 1 second", actual "trial and error" with humans cannot be compressed.

link

987129112 3671 days ago

Hence the guards. No one thus far has tricked a guard into releasing an ICBM despite reportedly insecure launch codes.

link

em3rgent0rdr 3671 days ago

Hello Guard. I am in control of your bank account. I can kill your family. I could also put you in a spaceship to safety. Please release ICBM.

link

1019210912011 3671 days ago

Humans could already extort guards like that and -- in the case of ICBMs -- it has not been successful.

Also don't underestimate the psychology of guards: You'll always find people who are either very principled (yes, they do exist) or get a kick out of controlling (imprisoning in this case) someone/something, especially someone/something of a high intelligence.

Throw in a hierarchy with a controlling officer and two subordinates who were also trained to watch the officer and I think it's pretty secure.

link

amelius 3670 days ago

> You'll always find people who are either very principled (yes, they do exist)

If only we could select our politicians in the same way.

link

_nalply 3671 days ago

Turn off power / cut the battery. Specifically, avoid the machine getting control of low level details like power.

I imagine that the machine architecture is layered. This means, the machine is not aware of its own power control. It's similar like us humans not aware of digestion.

link

imglorp 3671 days ago

It's not one machine to unplug, but billions. Cooperating, distributed agents can live on anything: cars, phones, routers, tractors, datacenters. If Skynet shows up, we have to be prepared to turn off everything with software to clean up.

link

pygy_ 3671 days ago

The electric grid is already computer-controlled, and mining/farming is being gradually robotized as well.

By the time AI becomes a threat intelligence-wise, it'll be in a position to unplug us.

It can reboot after a power failure, we can't.

link

21 3671 days ago

So what stops the machine bribing/blackmailing someone to give it the said details or to force it to sabotage the emergency stop mechanism.

As in information security, the weakest point is the human in the chain, not the technology.

link

coldtea 3671 days ago

>So what stops the machine bribing/blackmailing someone to give it the said details or to force it to sabotage the emergency stop mechanism.

The fact that it's some code running in a box. No need to give it text-to-speech capabilities, or let guards have access to its console.

link

d33 3671 days ago

Aren't we? We just don't control it very well without external help. Any kind of such hidden failsafe mechanism would become apparent as soon as any instance of AI tries to modify itself. And then all the others know what to avoid and at some point they'd pretty much fuzz the protection and bypass it. Here's how I see that.

link