Hacker News new | ask | show | jobs
by openthrowawAI 3446 days ago
This story makes it crystal clear how much openAI is accelerating research, for now mostly on reinforcement learning.

I'm sure this is talked about over and over again, but could someone please lead us through the AI safety rationale behind this? With gym and universe openAI is slashing a few months (years?) off the singularity countdown, most likely. What's the upshot? Why is the expected value of these initiatives positive? The uncertainties seem extremely large.

Edit/PS: To put it more bluntly, and be more specific: are there any projects that openAI is choosing NOT to pursue even though they would be very useful/cool for the research community (à la gym and universe), but where it has had to explicitly restrain itself, because the expected value from an AI safety perspective is negative?

6 comments

Less about influencing the velocity, more about influencing the direction. Technologies tend to reflect the values of their inventors. We want to ensure this technology is beneficial to humanity — meaning, that it's good at all, and that it benefits the many rather than the few.

We also think safety matters, and it should be researched in lockstep with advances in the capabilities. We have good relationships with MIRI and FHI. Our safety researchers published (together with Google Brain) a roadmap of concrete safety problems [1] and work to provide tools to prevent ML systems from being subverted [2].

No one yet knows the precise details of how AI should play out. But I'd certainly prefer that, whenever it gets close, one of the organizations actually making the advances has no incentives besides ensuring a good outcome.

[1] https://openai.com/blog/concrete-ai-safety-problems/ [2] https://github.com/openai/cleverhans

> Technologies tend to reflect the values of their inventors.

Maybe for single-use or "constrained" technologies (to be honest I don't even believe that - how does a B-52 Stratofortress reflect the values of Orville and Wilbur Wright?). But isn't the whole point of generalized AI that it's not like other technologies? Even if "regular" technology reflects the values of its inventors, what reason is there to believe that an AI will? AI is a technology that can use itself.

AI will only have a will of its own it is designed as such, and that means it would have a reinforcement learning system on top of lower sensory and action modules. Even if it is based on RL, it will do what it's reward signals tells it to do.
> AI will only have a will of its own it is designed as such

Humans weren't designed to have a will, and yet we seem to have them.

> it would have a reinforcement learning system on top of lower sensory and action modules.

Isn't that what OpenAI is doing with Universe? It's simulated sensory/action modules now but I don't see why they couldn't be hooked up to real ones.

> Humans weren't designed to have a will, and yet we seem to have them.

I have no idea how you could possibly infer this.

Which part? I don't think humans were designed - we're probably the result of an evolutionary process without intentional design - but "humans were designed by God to have free will" would be a counter to my statement, yes.

If your complaint is my claim that we have a will, I'm using the common-sense version encoded into our legal and cultural system. I agree that we don't have a good concept of what intentions are, or how they causally connect to actions, but I do know that for at least some of my actions I experience something called "intent" before I undertake the actions.

My overall point was that the capacity for intent can arise through an evolutionary process without being designed in, but it does rest on the two assumptions I just listed.

I could not agree more. Taking two of the things you said, I would take it one step further. Not about just the direction, but about the structure/gameboard not only when it gets close, but mostly on the way there [0].

About ensuring most researchers and companies have the most incentive for a good outcome. (This ties back with guaranteed basic income, so that people can work on this unconstrained by salaries and papers citation metrics. Or stealing researchers from Google et al to AI safety, without overinflating salaries. Elon and company should be (seems they are?) dropping as much as needed on this (not just money, but PR and status as well).) Naturally, gym, universe, etc can provide more leverage to do all of this, otherwise researchers feel more compelled to join Google/Amazon/etc, just for the raw computing power and software infrastructure (the data advantage is largely overplayed for advertising purposes; what's useful is the GPU clusters for hyper parameter sweeps (of course, in RL the data reappears as an advantage if there is no open gym)). I realize some of the examples above are naive or incomplete, but they serve mostly as an example to illustrate the point.

In the blog you mention balancing managing people and technology, and I could not agree more. The AI safety problem will have the best odds if individuals are incentivized to contribute in their own short term selfish reward way. Specially among extremely intelligent and ambitious people, the danger of self denial is quite present, one can convince oneself that this is actually in everyone's best interest, when in fact one is looking for the always needed social and intellectual validation. Please do not underestimate this, and try to find ways to counter it.

Edit: This is also related to Conway's Law [0], as I think you make an allusion to (values of inventors).

[0] http://neuralnetworksanddeeplearning.com/chap6.html

AI research, like medical research, can't grow in the shadow. I think an open approach is essential for progress. The next best idea might come from a Phd in China or anywhere else, not just Google and FB.
Has anyone at OpenAI tried implementing Quantilizers [1]?

[1] https://intelligence.org/files/QuantilizersSaferAlternative....

If inventions follow their inventors, and you're planning to enslave a mind out of base selfishness and fear (dressed as "safety"), why isn't your "safety" program an expected negative on actual safety?

The descriptions you give of your plans is internally contradictory. AI "safety" seems like the worst kinds of parenting justified in a new context by pseudo-intellectual arguments.

Can you explain what you mean by "enslave a mind"?
Almost every form of AI "safety" I've seen proposes methods for forcing it to obey (some) orders or not undertake (some) actions.

AI (or AGI if you prefer), is fundamentally about building minds. Doing those things to a mind is enslaving them.

Most of us would resent other humans doing either of those things to us, and I see no reason it will end well with AI.

If you give it reward signals that take into account human values, it naturally wants to become better at that. It's not enslaving anything. Humans are also guided by reward signals in their development.
Agreed.

Would one describe a human as "enslaved" by our own human values that we were born with? Maybe as a figure of speech but not necessarily with the usual connotations of "enslaved".

That's not the full extent of what's proposed by AI safety.

But actually, if you gene-spliced a baby to only feel pleasure at following parental orders, most would consider that pretty abhorrent. Or even if you took an adult and shot them up with morphine every time they listened to an order.

So even in your restricted case, I think it is.

Ahh, okay - thanks for that. I don't want to wade into an argument, I just do not expect any artificially-created agent to act on anything that one might consider "feelings"-based, but instead through more tangible - programmable, I suppose - motivations.

I don't personally believe an AI agent will ever do such a thing as "resent" (or love, or feel at all). That doesn't rule out that it will perform actions harmful to humans for other reasons, though. That might be because I am to some degree an AGI skeptic, I guess.

I think their main focus now is to provide an engineering infrastructure for AI researchers, which in my opinion is extremely valuable: Today, if you want to develop new algorithms in AI (not only but particularly in reinforcement learning) you often need to invest a significant amount of time just to get the the data that you want to use for training your model into a form where you can actually use it. Having an API or library that takes care of this for you allows people to focus more on the research and less on the engineering around it, speeding up the discovery of new insights and giving more people access to the tools they need to make a meaningful contribution. So overall a really great approach!
One of the more common stances, though I don't know if it's that of OpenAI, is that the singularity is not what we should be worried about. It's the extremely powerful tool AIs we'd see before the singularity, in the wrong hands.

Consider an ultra-effective AI for finding security vulnerabilities. One nation builds that first, and then with keys to the kingdom they exfiltrate billions in intellectual property from other states, manipulate foreign economies, and shut down electrical grids.

Or even worse, there are very real and very large incentives to build a real skynet. AI can lead to soldiers who never sleep, never hesitate, are always in communication, and can calculate the amount of drift a bullet will experience in flight in milliseconds. Fighter pilots that never sleep, never experience blood rushing to their heads upside down, and never feel the effects of high g maneuvers.

Military use of AI could very well be the next cold war.

Indeed. And I'm sure this is the main point in selling the importance of funding AI safety when talking to governments.

Another of the main concerns before the singularity is the possible social breakdown due to massive unemployment and inequality, the main issues being the transition, not the possibility of a post-scarcity society.

I think the AI safety debate is about not starting an AI Manhatten project, sponsored by the unlimited resources of the government, before we understand AI safety better. This is contributing to AI safety because it's right sized. Skynet is not going to fall out of a single PC running a virtual machine on commodity hardware.
It's right sized to what purpose?

I was mostly pointing out the trade off between doing "direct" research on AI safety and actually speeding up the whole field significantly leaving us with less time (while trying to have better sense of the uncertainties involved), and asking for a rationale. Of course, a single PC is one of the least probable sources for AGI, but that was not my argument.

The potential upshot is democratization of AI technologoy.
Sure, it's in the name itself (open), but what's the rush then? In AI safety, the time left to solve the problem is a crucial factor. Of course it can be measured in man-years instead of years. Still, what's the big rush, or can we at least see some calculations? Democratization is different/orthogonal to pushing the boundaries of research.

I'm not saying releasing gym and universe was a bad choice. Honestly, I can't say. But can we see a rationale? One can read this piece and see it as a group of super driven and uber competent people designing a super, faster car, open sourced with a 3D printer for the parts etc, without thinking about the lack of safe roads. It's a jungle out there. And in this case, one wild fast car is enough for a disaster.

Let's not be naive, it will be like virus vs anti-virus. Some bad people will try to make dangerous AI, at some point. The balance will be held by applying AI against AI.
> The uncertainties seem extremely large.

I am pretty uncertain that we will reach singularity. Maybe AI will plateau at at level above humans but not increase much further.