Hacker News new | ask | show | jobs
by meehai 608 days ago
one answer is due to the fact that humans also do this with just 2 pretty bad cameras and a lot of offloading to the cortex.

It also simplifies the stack a lot to have a single set of sensors, so the software becomes mostly: getting good training data (iterative loops from failing production cases) and an efficient training algorithm.

This scales to more than just AD and also can leverage new breakthroughs from academia

9 comments

> one answer is due to the fact that humans also do this with just 2 pretty bad cameras and a lot of offloading to the cortex.

No, humans do significant sensor fusion.

There's binaural audio: useful to detect and have a rough position of emergency services and/or high speeds cars(it's true that its a relatively unencumbered channel, but that makes it all the more valuables for emergencies)

And there's a working if imperfect IMU (performance can be altered if the power supply is set to an alternate mode) to sense all kinds of acceleration: for fine course correction on acceleration and bearing, for getting the road condition and adjust the driving profile accordingly, etc..

Humans don't just have two eyes though (and our eyes are pretty damn good as far as organic light sensors go), we have 3 mirrors giving all-around sight, a whole body full of nerves providing feedback, and excellent 3D reckoning to keep track of other vehicles.
Our eyes (+ brain visual processing) are way better than the vast majority of cameras you can buy, judged on things we care about in driving. Typically only high-end film cameras approach the dynamic range of our eyes, for example. The only major thing that even cheap cameras easily beat us at is zoom, but that is mostly irrelevant for this use case.
Humans also have only legs instead of wheels. The whole point of having cars (or any machine) is that they're not limited to my biology build.
With this logic you should also insist on legs instead of wheels.
The human eyes are also mounted on a neck that can turn it around in all directions. Kinda useful for parking
Humans have millions of years of evolution in vision processing, and yet we still regularly have 100+ deaths a day in the US, plus many more injuries and fender benders.

I don't doubt that it's possible with machine vision alone, but it makes the challenge substantially harder.

Another way of saying this is that we do this with a system comprised of human eyes and a human brain. We’re very good at making machine eyes, but the brain part is proving extremely difficult for us to reproduce with machines.
I mean, human eyes are in some very important ways better. Not least, they're self cleaning and can adjust their angle for a better view.
And they have way better dynamic range.
Yup - if there was a camera today that could consistently (and without any fiddling) reproduce things the way my eyes see them (especially in high dynamic range or low light situations), I would buy it immediately!