| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pr337h4m 617 days ago
	Humans don’t use lidar, which clearly shows that a vision-only robotaxi is very much feasible.

12 comments

trhway 617 days ago

Birds have to flap wings while our planes don't have to. There is absolutely no reason to limit self-driving cars in the same way our bodies are limited.

When it comes to AI though, humans are using biological neural net much more capable than any today's AI you can cram into a car. So, even if one accepts your premise of targeting human performance as a design guideline, more sensors is still logical at this point as way to compensate for the weaker AI.

Also, if you read how Tesla does vision it is very different from, and i think inferior to, how your eyes and brain build the 3d map of the surroundings. If one is limiting oneself to only vision, the first thing would be to try to get as good as possible that 3d mapping, and the vision seems to be among the simplest and most researched brain functions, ie. easiest to reproduce. As Tesla doesn't seem to be doing it - only may be couple years ago they only started to elicit the 3d model - i think they aren't on the shortest path to success when it comes to FSD.

meow_catrix 617 days ago

Planes do ”flap their wings”, just not the ones protruding from the fuselage.

trhway 617 days ago

I think you're mistaking rotating for flapping. Rotation is one of those fundamental things differentiating our technological civilization from Nature.

meow_catrix 615 days ago

Those rotating things still produce their thrust by pushing a wing-shaped structure through air, producing a high-pressure zone on one side, and a low-pressure zone on another. That is what I was getting at. It is the same principle.

trhway 613 days ago

No, it is different. A prop or fan blade is inmovably attached to the shaft and pushed through the air the same way like the plane's wing, and the blade isn't flapped like the bird's wing.

andsoitis 616 days ago

> Rotation is one of those fundamental things differentiating our technological civilization from Nature.

Rotation is very common in nature.

Planetary rotation, inner-core rotation, spinning galaxies, dung beetle rolling, Keratinocyte migration, Rotifers, spirals, rotational symmetry, etc.

What isn’t common (but not non-existent) is using rotation for locomotion in biology.

meow_catrix 615 days ago

Many plants and trees spread rotating ”helicopter seeds”. Many vines roto-grow themselves around vertical supports. Day flowers rotate to follow the sun.

Apples and oranges fall on the ground and can roll far and wide. Walnuts too.

Partial rotation is still rotation, of course: see animal joints in walk, trot and gallop.

And then there’s the belly-up pig drunk on brewery grain rolling down the hill. That mash packs a wallop!

andsoitis 614 days ago

Yes! Which is why the idea that “ Rotation is one of those fundamental things differentiating our technological civilization from Nature” is not all that useful a statement.

K0balt 614 days ago

Huge swaths of microbes use electrostatic rotary motors driving screw type propellers, so if not say it’s that uncommon.

eesmith 616 days ago

Bacterial flagellum rotate. See also https://en.wikipedia.org/wiki/Rotating_locomotion_in_living_... .

fauigerzigerk 617 days ago

Humans don't act based on visual patterns alone though. We act based on our understanding of the world as a whole, including the intentions of other humans.

For instance, when we see a ball rolling onto the street, we know that there is probably a young person nearby who wants that ball back. We don't have to be trained on the visual patterns of what might happen next.

Of course AI can be trained on the visuals of high probability events like this. But the number of things that can potentially happen is far greater than the number of training examples we could ever produce.

Ukv 616 days ago

> the number of things that can potentially happen is far greater than the number of training examples we could ever produce

Models don't need to have been trained on every single possibility - it's possible for them to generalize and interpolate/extrapolate.

But, even knowing that it's theoretically possible to drive at human-level with only the senses humans have, it does seem like it makes it unnecessarily difficult to limit the vehicle to just that. Forces solving hard tasks at/near 100% human-level, opposed to reaching 70% then making up for the shortcoming with extra information that humans don't have.

fauigerzigerk 616 days ago

>Models don't need to have been trained on every single possibility - it's possible for them to generalize and interpolate/extrapolate.

They do have some in-distribution generalisation capabilities, but human intentions are not a generalisation of visual information.

Ukv 616 days ago

"human intentions are not a generalisation of visual information" is a bit confusing category-wise. Question would be to what extent you can predict someone's next action, like running out to retrieve a ball, given just what a human driver can sense.

Clearly that's possible to some extent, and in theory it should be possible for some system receiving the same inputs to reach human-level performance on the task, but it seems very challenging given the imposed constraints.

Also, for clarity, note that the limitations don't require the model be trained only on driver-view data. It may be that reasoning capability is better learned through text pretraining for instance.

garyfirestorm 617 days ago

Humans don’t have radar, or thermal cameras, or ultrasonic sensors, doesn’t mean planes and boats shouldn’t use those

pelorat 616 days ago

Humans eyes are an order of magnitude better than the cameras in a Tesla. Humans also have a database in their head and remembers how to behave in certain situations. FSD doesn't have any database of any kind.

svantana 617 days ago

That same argument can be used for all companies to fire all their employees. They are all human after all. Just implement all the needed features in hardware and software, done.

p_j_w 616 days ago

Humans use our brains to drive. Unless you're planning on popping an actual human brain or something that can perform equivalently into the car, you'd do well to consider more superior sensor suites.

threeseed 617 days ago

Humans continuously move their heads in three dimensions to infer depth.

Cars can't do this.

And not surprisingly the biggest problem with FSD is the accuracy of its bounding boxes.

e_y_ 616 days ago

Citation? Humans are not constantly moving their heads to the degree that chickens do, and I find it doubtful that the micro movements from our head (which our eyes have to adjust for with the vestibulo-ocular reflex so things aren't blurry, similar to image stabilization in cameras) are large enough to infer depth.

threeseed 616 days ago

I never said people are moving their heads like chickens.

But we do move our heads around pretty frequently. Enough to build mental records of what the bounding boxes are going to be for a range of objects.

e_y_ 616 days ago

We're not doing that while driving, though.

If we're talking purely about going off memory, there's no reason why machines couldn't build up a similar catalog (which could be used by every self driving AI once learned). And human ability to judge distances varies significantly between drivers.

fragmede 617 days ago

feasible? I want the thing to drive better than me, especially in the rain, fog, and the dark!

BoorishBears 617 days ago

I can't tell if this is satire, or if replicating 6 million years of evolution has legitimately become handwave material for Elon's supporters...

InDubioProRubio 617 days ago

They are afraid, times of crisis - especially planetary one, have the weaker minded and scared ones always rally around figureheads. Some guy in operetta uniforms, exclaiming "Im the captain, give me all your cash" brandishing a detached steering wheel is what the passengers want to see. Reality be a lovecraftian horror to much to bear.

consp 617 days ago

Mammalian vision and vision itself have been around a lot longer than 6 million years by at least one, likely two, orders of magnitude.

ben_w 617 days ago

I don't know if you've tried this recently, but take a photo of something on your phone and put it into an AI.

There may even be an AI built into your photo library app.

BoorishBears 616 days ago

The fact I work on self-driving cars makes me a tiny bit more of a realist than someone who thinks CLIP is proof of what AI can and can't do...

danjl 616 days ago

I'm curious. Can you elaborate on what CLIP proves about what AI can and can't do?

BoorishBears 616 days ago

My point is that it doesn't.

The fact your phone can identify an object doesn't inform you on the capabilities of self-driving car's vision stack. It's complete non-sequitur.

ben_w 616 days ago

So your job is to, in your own words, be "replicating 6 million years of evolution"?

You know how big your own team is, and that your team is itself an abstraction from the outside world. You know you get the shortcuts of being able to look at what nature does and engineer it rather than simply copy without understanding. You know your own evolutionary algorithms, assuming you're using them at all, run as fast as you can evaluate the fitness function, and that that is much faster than the same cycle with human, or even mammalian, generational gaps.

> CLIP is proof of what AI can and can't do

CLIP says nothing about what AI can't do, but it definitely says what AI can do. It's a minimum, not a maximum.

abduhl 616 days ago

Not to be rude but you're arguing with somebody that works in what I would assume is a highly mathematical space and asserting your opinion on how quickly that highly mathematical space can advance while your own profile admits that you were unable to understand "advanced calculus or group theory" and your own github indicates that you are stuck on "the hard stuff — abelian groups, curls, wedge products, Hessians and Laplacians" because you "don't understand the notation." Your opinion on the speed of advancement just doesn't seem informed?

Maybe this is an old post and your understanding has dramatically improved to the point where you're able to offer useful insight on ML/AI/self-driving?

https://benwheatley.github.io/blog/2024/03/11-12.00.16.html

ben_w 614 days ago

1. Note time stamp: https://github.com/BenWheatley/char-rnn

2. Most ML is basic calculus and basic linear algebra — to the extent that people who don't follow it, use that fact itself as a shallow argument.

3. I'm not asserting how fast it can advance, I'm asserting that the comparison with "6 million years of evolution" is a as much a shallow hand-wave as saying it's trivial, as evidenced by what we've done so far.

IshKebab 617 days ago

You mean "very much theoretically possible".

falcor84 617 days ago

s/feasible/possible/

gniv 617 days ago

Think of pile-ups. No matter how good a driver you are there are situations where you cannot prevent crashing. But lidar can.

Mawr 616 days ago

Pile ups happen because people drive:

- Over the speed limit (it's called a limit for a reason)

- Too fast for the conditions (speed limit != speed target)

- Too close to the vehicle in front of them

There are very few situations that can't be prevented by driving properly in the first place.

mlindner 617 days ago

Pray tell how a Lidar prevents crashing in this situation?

actionfromafar 616 days ago

Accurately determine distance to objects in almost no time. While a human has 1 second reaction time. There will be situations a fast reaction time alone can save.