Hacker News new | ask | show | jobs
by mensetmanusman 2139 days ago
Good analogy about discovery of Pharma molecules.

It’s really fun to think about the fact that Tesla has more than enough data to unlock autonomous vehicles, but all that is missing is the correct AI architecture to get it working...

Who will figure out how to code that? Will it be a breakthrough, or can sub-optimal architectures eventually reach equilibrium with 10x or 100x the amount of time/data processing.

3 comments

> Tesla has more than enough data to unlock autonomous vehicles

Many people in the automotive industry, myself included, disagree with this statement pretty strongly. Driving data quantity is not equivalent to quality and they are severely lacking in advanced sensor data.

I'm likely biased because I spend some of my time doing perception research, but I find the "advanced sensors are necessary" argument so odd. We have clear evidence from humans that you don't need them. I expect we'll be doing this sort of thing [0] in toy dynamic scenes from monocular vision in ~1year, and in real-time on city scenes in ~2. Perception-wise, what more do you need?

Planning and control seem much harder, but that's not a sensing problem.

[0] https://nerf-w.github.io/

So is the claim by Elon Musk that current iterations of Tesla vehicles have all of the sensors and compute power needed to be fully autonomous (Level 4+ I guess?) in the future, via software updates only, a specious one?
It's hard to be absolutist on the response to that: anything is possible, and humans can drive without LIDAR.

But at the moment it seems a strange position to take: we know LIDAR data is useful in many circumstances, and we know it can solve a number of the hard parts of computer vision.

Musk also said he was taking Tesla private at $420 a share, funding secured.

He says a lot of things.

I don’t know, the existence proof is that it takes a 16 year old a few days of driving before they get it well enough...
I'm sorry, but is that true, that Tesla has enough data to unlock autonomous vehicles? My experience is that until you get an ML model to do X, you never know if you have enough data to train it to do X. Or is that just your opinion, that they don't need more data?
Perhaps. It seems it’s still an open question whether AI is just about memorizing your data, or can it actually make reliable decisions during previously unseen scenarios.

Have we already observed, or collected, all that is possible in the “driving” world?

> It seems it’s still an open question whether AI is just about memorizing your data,

No - it's not at all, and this is a well understood problem in building machine learning systems. There are some cases where this occurs but usually this is just overfitting.

Good AI systems generalize well on unseen data.

To the downvoters, I give you AlphaZero.

Not only is every game of Go it plays and wins brand new (so no memorisation), the same system learnt to play Chess without knowing the rules, and plays in a "style .. unlike any traditional chess engine"

https://deepmind.com/blog/article/alphazero-shedding-new-lig...

I've yet to see an "AI" that is not just memorizing data.
Just memorising data is simple, use a file on disk. The hard part is recognising data when it is slightly different than what the model 'memorised', deciding which of the millions of things it learned best fits the answer we desire.
Then you haven't really looked.

Most credible machine learning systems work well on unseen data, which by definition isn't memorizing.

> Most credible machine learning systems work well on unseen data, which by definition isn't memorizing.

Sorry, but no. ML models don't generalise well outside the training data, but they can interpolate inside. This question becomes very interesting in the case of GPT-3 which has had a huge corpus of text to train on, so it's probably seen 'everything'. It's still memorising for GPT-3 but also learning to manipulate data, like software algorithms.

ML models don't generalise well outside the training data, but they can interpolate inside.

I'm unsure if you just misstated this or don't know, but this is wrong.

ML models don't generalise well on data outside the distribution of their training data. But that's an entirely different thing, and doesn't mean at all they are memorising data.

Imagine something training on the US unemployment rate until 2020 being hit with the COVID rate. It wouldn't know what to do, but that doesn't mean it wouldn't work fine on a rate of 5.342% even if it had never seen that rate before.

This is a simplified example, but applies to everything.

GPT-3 generation of text does pull from memorised training data. There's a lot of stuff going on there, and amongst other things there has never really been a system that does textual generation well. It's also hugely overparameterised, so lots of potential for overfitting. I don't think it's a good example of a "good" AI system - it's very interesting, full of potential, but there are lots of issues.

Can you link an example you find to generalize particularly well?
Sure. Apple's Face Unlock.

It generalises to almost every face, and the ones it doesn't its failure mode is safe.

Or something like word embeddings. Works incredibly well, and most "failure" modes are around things like bias, where the behavior reflects the real world.

Or something like AlphaZero. Not only is every new game of Go it plays brand new, it learnt to play Chess without knowing the rules. That just isn't memorization.

https://deepmind.com/blog/article/alphazero-shedding-new-lig...

Word embeddings suck. Take a look at the graphs under " 2. Linear substructures" in

https://nlp.stanford.edu/projects/glove/

and note that they all involve looking at a small number of points. It is easy to reproduce plots like that but if you try to increase the number of points the result breaks down completely.

It is a curve fitting problem: for a small enough set of points compared to the number of dimensions, you can find a matrix that projects a set of random points to an exactly specified set of points in the plane. If you relax the problem to something like "put colors on the left side, put smells on the right side" you will get better than random performance from that kind of model, but not that much better than random.

Word embeddings are a strategy that approaches an asymptote. Systems that are destined to low performance will perform better if you use a word embedding, but they throw away information up front that makes high performance impossible.

https://arxiv.org/pdf/1805.12177.pdf

I only read the abstract.

But then again, how does something like the following work?

https://twitter.com/GoogleAI/status/1293970520753369088

Any idea how it could be fooled?

What do you mean by "fooled"?

I'm very familiar with the BlazeFace and FaceMesh models (which are related to this in Google's MediaPipe framework).

They have weaknesses - they aren't designed for running upside down for example, so if they get data that is that way oriented they will tend to fail.

They aren't designed for "life" detection, so you can show printed pictures of a face and it will detected it.

But they give you confidence scores etc, so if you give it something like a caricature of a face it will return reasonable confidence numbers indicating it isn't as sure of its predictions.

Most self-driving companies use simulations to see how the model performs in unseen scenarios.
I'm sure that's tremendously helpful but isn't exactly an antidote, is it? You can't really simulate an unforeseen situation. If you can simulate it accurately - then you've foreseen it, and are thus able to account for all relevant variables. If you are randomly generating scenarios, something like fuzz testing or property testing, then I am sure you will discover bugs before they hit production, but you can't be sure you're simulating it accurately.

For example, maybe when your car is T-boned while climbing a steep hill, the suspension behaves in a way you didn't expect and which isn't replicated by your simulation. Or maybe you're sent a batch of decals with bad adhesive, and in the hot sun the begin to slip down the windshield until they end up obscuring or otherwise interfering with a sensor.

The only simulation that can account for every variable, regardless of whether you've anticipated it, is reality.