Hacker News new | ask | show | jobs
by Animats 1330 days ago
This is a common sentiment, and pundits have been making similar remarks for decades. This author writes "Sixty years later, however, high-level reasoning and thought remain elusive."

That's the wrong problem with AI. The trouble with AI is that it still sucks at manipulation in unstructured situations and at "common sense". Common sense can usefully be defined as getting through the next 30 seconds of life without a major screwup. At, at least, the competence level of the average squirrel. This is why robots are so limited.

If we could build a decent squirrel brain, something "higher level" could give it tasks to do. That would be enough to handle many basic jobs in unstructured spaces, such as store stocking, janitorial, and such. It's not the "high level reasoning" that's the problem. It's the low-level stuff.

A squirrel has around 10 million neurons. Even if neurons are complicated [1], somebody ought to be able to build something with 10 million of them. Current hardware is easily up to the task.

The AI field is fundamentally missing something. I don't know what it is. I took a few shots at this problem back in the 1990s and got nowhere. Others have beaten their head against the wall on this. The Rethink Robotics failure is a notable example.

The real surprise to me is how much progress has been made on vision without manipulation improving much. I'd expected that real-world object recognition would lead to much better manipulation, but it didn't. Even Amazon warehouse bin-picking isn't fully automated yet. Nor is phone manufacturing. Google had a big collection of robots trying to machine-learn basic manual tasks, and they failed at that.

That's the real problem.

[1] https://www.sciencedirect.com/science/article/pii/S089662732...

8 comments

> Current hardware is easily up to the task.

I don't think so. If you want to model a single synapse in full to capture all effects that might lead to "learning", you have a system of ordinary differential equations. Solving that is very hard, and solving that for 10 million neurons is impossible.

On current hardware can only implement but a poor caricature of a real neuron.

While this is true, the complexity perspective misses something more fundamental.

1) Our brains, and moreso those of animals, come with a really good pretraining at birth. This is collective genetic knowledge of millions of generations distilled into your brain.

2) Our brains have a lot of sensors and actuators to interact with the world. We only learn by reading as adults when our brains can already do the synesthesia of translating words into thought. But even as adults, most of us learn better if we do something, write something, engage in dialog, instead of passively listening, reading, or watching.

Passive data can never replicate the rich environment our brains grow up in.

While true, there’s a relatively small upper bound on how many bits of information are in this pre-training. Specifically, in the form of how much information is contained in DNA, which is only a couple gigabytes.
Stable diffusion model is around 4 gigabytes, inside that 4 gigabytes you have understanding of the whole english language model and mapping to billions of objects, people, concepts etc capable of generating from just a single sentence almost any picture in any style you imagine. Seems like a few gigabytes can hold a lot of information.
That problem has been overcome.[1]

This is a neat result. This research started with the differential equation model of a neuron and tried to train various neural nets to get the same result to within 99%. They succeeded. Worst case took an 8-layer net with 256 elements per layer. See Fig. 4. So, 10 billion elements for a squirrel. Not that big by current standards.

It's not clear that a model which tracks the biological neuron that accurately is needed. They discuss simpler models that are almost as good.

Low-end mammal brains should be buildable right now. It's not a hardware limitation.

[1] https://www.sciencedirect.com/science/article/pii/S089662732...

> On current hardware can only implement but a poor caricature of a real neuron.

We don't need a complete physiological model for it to be useful. We don't need a perfectly accurate silicon-based mirror of a mammalian brain to outsmart ours on every task we do (and many we don't even realize we could). The challenge will be to coexist and cooperate with these completely alien intelligences that share almost nothing with ours.

I agree with your comment (but the OP point was not about which approximate model might still work)
I think this requires the assumption that modeling the complexity of biological synapses is required for general intelligence, when we don't know that to be the case. Personally, I believe that it's not a requirement at all and that the first AGI will be strikingly non-neuromorphic. Just my two cents though.
Real neurons are far slower (interaction is chemical vs electrical) and far less precise ( iirc something comparable to 4 - 7x less precise than 32bit float) than physical neurons.
Biological brains have had a few billion years to optimize. Over the past decade or two, it's been increasingly apparent that the structure and algorithms that govern a particular neural net's behaviour are extremely important to its efficacy.

We likely have a very warped view of what intelligence is, because the most prominent examples of it have been aggressively honed over an extremely long period of time to be good at tasks crucial to their survival, such as effectively navigating a 3D environment. We consider art to be a difficult and complex task, and making a sandwich to be a simple one, but that's because our particular brand of intelligence is optimized toward the latter.

> Biological brains have had a few billion years to optimize.

Not just that, but it's grown on a body which has been optimized for survival during a few billion years; and that body is built on cells that have evolved to survive hostile environments, and those cells are built with self-replicating molecules, evolving from complex chemical reactions in several changing environments, that competed with and displaced other less-successful self-replicating molecules that disappeared.

Each of those layers provides a degree of adaptability and self-healing that is extremely hard to replicate. And if we managed to reverse-engineer and replicate one of those layers, it would still be missing all the layers below.

Our best hope to create fully independent agents will come from re-adapting and controlling biological entities, not from tools built from the ground up with current engineering techniques.

Multi-cellular life has only been around for 600 million years or so.
Imagine if the immediate outcome of AI is not that we replace taxi drivers, dishwashers, and factory workers, but instead we displace most knowledge-worker white collar jobs, like quant and software engineer?

There's an old (and sometimes forgotten) idea in AI that perhaps things we think are simple, like vision and control (robotics), are actually incredibly complicated and took millions of years to evolve.

Whereas things we think are complicated, like playing Go or picking stocks or computer programming, are actually quite simple to learn.

This would be counter-intuitive but---as you observed, and taking my argument recursively---common sense might be much more difficult to get right than obscene pathological thinking.

Anyway, I've always thought a good startup would be to automate away Silicon Valley using AI. It's so punk rock that a lot of disillusioned smart techies would join under this banner. A collaborator of mine has already used AI to do high-level bug finding in blockchain code.

I'm not sure that people appreciate how even the highly technical white collar jobs have large social elements in them. You might be able to get the AI to write the code, but can you get it to attend the meetings?

And it's understanding what the right thing is to build that's the critical challenge in programming.

What if you no longer need meetings? Take accounting software for instance. This function will probably go from an entire team of accountants to one of the C-levels just triggering the right software at the right time as part of their normal duties.
Think about why this isn't the case already? What specific capabilities does AI introduce to accounting?
The software just isn't there yet, but we have some inkling of what might be possible in just a few years. Perhaps a closer analogy would be human computers. You would have meetings with them back in the day to set out calculation tasks, but now they are so reduced away that their existence is in itself something that has been forgotten by most. Employees just perform the duties of the human computer throughout the course of their day without even thinking that they replaced what used to be an independent function.
True but that won't save men. Already women are better suited for jobs that involve communication and empathy. I was in a hospital last week: almost a full female staff.
Humans are famously unable to adapt to changes in their environment.
> There's an old (and sometimes forgotten) idea in AI that perhaps things we think are simple, like vision and control (robotics), are actually incredibly complicated and took millions of years to evolve.

https://en.wikipedia.org/wiki/Moravec%27s_paradox

I think there's a lot of truth to this. I'm new to ML, still going through the ropes on some online courses, but already I can see that, once I get a bit of muscle memory in setting up models etc, there's a whole lot of power and efficiency to be unlocked by using simple models - specifically in CS/X and Marketing. Obviously model quality matters, so you have to have proper monitoring etc, but this stuff is low hanging fruit and should enable teams to be so much more efficient.
In a lot of these cases - maybe most, in fact - the sludge in the data pipeline makes makes the low hanging fruit hang high.

I've worked on a number of projects where it looked simple to automate from the outset and impossible in retrospect.

Agree entirely, I wouldn’t want to build a saas startup around it due to data quality, but if you have control of your pipeline it’s easier.
I dont't think it's intuition. There's a whole field of junk economics dedicated to telling us that your position in the economic class hierarchy determines the automatability of your job. In general it goes unquestioned. A vast amount of capital is also deployed based upon this assumption.

This is an example paper that, for instance, mathematically blurred the distinction between offshoring and automation:

https://talkbusiness.net/2017/07/ball-state-study-automation...

There was another paper (that i cant find right now) that basically surveyed people about how creative they thought their job was and just assumed that creativity was inversely proportional to automatability.

Ironically I think a widespread belief in this myth helped, among other things, lead to the trucker shortage. Who wants to join a profession with a high barrier to entry that they believe will be automated soon?

If software engineers end up automated away before truck drivers are, (not a completely harebrained concept given that one type of AI is doing better than expected and the other worse), it will put a hilarious spin on the "truckers should just learn to code" concept.
Those "Complicated" tasks are all built in artificially constrained systems with limited degrees of variability, which is perfect for an algorithm to learn.

Those "Simple" tasks have so much variation in them that it takes a billion+ years of evolution + the genetic pretraining to be able to perform.

So who says that picking stocks or computer programming don't require common sense?
There’s not a single missing something, there’s at least 2.

One of them is physical structure. You can get 10M somethings, sure, but how do you wire them together is probably more important than how many there are. And there’s many possible combinations.

The other missing part is that we have not figured out the high level software. A squirrel brain is a “desktop PC running windows”-level of utility. A bunch of neurons interlinked is some fashion is equivalent to a blank CPU. We know how the individual transistors work, but the BIOS, and OS are still unknown.

It’s quite possible that problem 1 and problem 2 are related, because evolution doesn’t care about making things easy to understand for us with clear delimitations.

They need well-factored accurate multimodal world models. Things like transformers and stable diffusion are promising such as the 3d video stable diffusion paper or DeepMind's multimodal transformer.

One thing that has held back progress is the way putting knowledge directly into the system has become taboo. So much so that they often fail to even guide the training towards really core aspects of the world model. Or even deliberately going about it with the assumption that everything from start to finish must be determined from the barest input data such as pixels. Then being surprised when it learns random inaccurate and overfit models that miss the underlying hierarchical structures.

> The AI field is fundamentally missing something. I don't know what it is.

I can only speculate but it certainly is for a reason that certain parts of our body are vegetativelly controlled whereas others are under the active control of our consciousness.

If you step on a nail, the first reaction comes from vegetative stimulus, later your consciousness processes that information. A squirrels neuronal network is also separated in that way. That may be a reason.

And second, AFAIK AI still doesn't 'think' in concepts, it has no notion of the 'world'.

And third: The capability of reproduction and acting accordingly may be another thing.

Automation only happens when it's cheaper than the human counterpart. In the US there are plenty of immigrants whom cost peanuts to employ. I don't expect the robotic future to come from North America.
> A squirrel has around 10 million neurons.

I don't believe this is correct. It's too low.

It's more like 400 million.

> somebody ought to be able to build something with 10 million of them

Build them, sure, but they need to be connected in the right way.