Hacker News new | ask | show | jobs
by tegmark 1038 days ago
if you set up gtp in a loop continuously prompting “given the current sensor inputs, how can i best serve my own interests?” you would already be most of the way there even if it did some strange and stupid things half the time. all you would need is a layer that translates those outputs into physical action. you could probably get most of the way there with more gtp instances tasked with translating the action items above them into more granular actions. if gtp didnt hallucinate or make mistakes, which would be consistent with the history of other technologies, then this simple setup could be extremely formidable. it could definitely be a threat to humanity. and thats just what i can think of off the top of my head.

to dismiss the possibility that machines will soon be created that can think clearly and independently, and that they would be a threat to the human race and our way of life, is extremely foolish. it was foolish even before gtp but now its downright childish and selfish. just swallow the bitter pill like the rest of us.

2 comments

What does "my own interests" mean to chat GPT? It doesn't even have continuous existence, it exists only in a query/reply fashion. It doesn't have memory, a body to identify with, emotions, goals, fears, desires, or a notion of action. It can read text and spit text. We're not quite at the "robot starts a machine revolution" stage.
This is an interesting crux.

My thoughts on this have arrived at these ideas:

The internal experience of being a robot does not matter. Whether or not we consider it sentient does not matter.

What matters is if it exhibits the external characteristics of sentience, of goal driven behavior, etc.

As a thought experiment, if we equipped a recursive instance of got4 with a goal of human extermination with an loyal army of physical agents it could conceivably pose an existential threat.

It doesn’t matter that the agents don’t yet exist. They could be humans, they could be robots, it doesn’t matter.

But the balancing factor here is that in this semi plausible scenario, arguably the most plausible given current circumstances, the AI would pose no more threat than a highly organized and motivated a group of humans without the Ai.

It’s just scarier because it seems less likely that a group of humans would have human extinction as a goal.

your conclusion really depends on the AI. a billion humans put together arent as smart as the intelligence of one human multiplied by a billion. even if humans could coordinate without any friction or loss, there is no group of humans that could outsmart the machines that might emerge from these circumstances. in reality, large groups of humans are really dumb. wisdom of the crowd is narrow.
The ai could definitely have a huge advantage in coordination and synchronized activities.

I made this comment as a tongue in cheek caricature of doom/gloom predictions but it probably fits better here:

“When the machine wars first started, it wasn’t with a bomb, a train collision, or even a mildly annoying infrastructure disruption. Turns out, sci-fi had gotten it all wrong this time.

It was an app.

Of course it was an app. Born of boardroom desperation, the Savey app would unironically recommend chlorinated cocktails and insecticide sandwiches as economical food choices, and a tide-pod gobbling populace gorged themselves on the deadly buffet in an tictok fueled epidemic of AI rage.

Millions died, and it was only a matter of time before the mycelium of AI undergrowth would bud and spore its way into every corner of technological life.

The infection burned through the ignorant masses first, feeding on bigotry and hate, turbocharged by social media algorithms and paranoia politics to twist tribal tendencies into violent clashes amplified by immaculate coordination and psychological priming.

Somehow it seemed that wherever unrest flared, both the matches and the gasoline were always on hand.”

none of the parts that make up your mind have any of that either. it doesnt need desire, it can behave in any way that is necessary through the prompt, with all prompts generated based on a single seed prompt “protect your temporal interests.” the system i described would have the behavior that i described. what part of that is not making sense?
I'm much more concerned about this kind of thinking which I hope it reflects the HN bubble and not the world generally. Particularly the conflation of something happening as part of a computer program with the real world. As well as the fallacy of "something bad could be invented and even though I don't understand how or see any path to it, I'll pretend current unrelated technology is related and fearmonger about it." This seems to be an education issue.

It confuses me how if people speculate about some stuff, say vaccine or disease research, and suggest that it could lead to something bad, they're conspiracy theorists or deniers or whatever (possibly with reason). But if someone literally just makes up some "terminator bad" nonsense based on their own ignorance there's some hushed reverence.

You're mischaracterizing the concern, I think. I agree with you about Luddite alarmism based upon ignorance. This (the concerns voiced by many leading researchers in the field of AI) absolutely isn't that, I promise you.
youre wrong and its because you are having an emotional block. you cant accept something like that. you decided that it couldn't be true as soon as you saw the conclusion and have never been able to see clearly the chain of reason leading to that conclusion. if youre so confident that youre right, then have a friendly debate with me on twitter spaces or another real time platform. even a coffee shop.