Hacker News new | ask | show | jobs
by fshbbdssbbgdd 617 days ago
The idea that Tesla would win the robotaxi race by not needing LiDAR died sometime between when LiDAR cost $100k and when it cost $1k. Now it’s just Elon being intransigent.
2 comments

can I get a lidar for 1k like the cars use?
There is no reliable FSD implementation on any car right now so it's kind of an irrelevant question.

The more relevant one is what will happen first. Tesla figuring out how to make vision only work on their existing hardware. Or the price of LiDAR coming down.

I’ve ridden Waymo in SF and it has gone great. The cost was cheaper than the cheapest Lyft/Uber, but in a much nicer vehicle. I felt 100% safe the whole time, which is better than I can say about humans who get paid more if they drive faster. My only complaint is cases like where it “wasted” a few minutes because it didn’t want to do an illegal U-turn during the pickup (any human driver would have done it).

The word “reliable” without any units attached isn’t well-defined, so I can’t say whether Waymo meets that bar, but it’s a good customer experience.

FYI Waymo is typically ~30% more expensive than Lyft/Uber in SF.
On-chip lidars are coming to automotive sector currently
You're going to have to be more specific with your necessary specs, there are $100 360° hobbyist LIDAR sensors on Amazon.
No there aren't. Those use triangulation. LIDAR is time-of-flight. They also only scan a single rotating point which is only sufficient for simple robots like vacuum cleaners.
Intel's L515 lidar from 2020 was <$300, uses MEMS ToF instead of rotating for very high speed scanning. 730p@30.

Good indoor range but not really useful outdoors at any range. Scaling to higher power is indeed a challenge, but that Intel delivered so so much in 2020 for such a small price is awesome, shows potential.

Damn that looks amazing! Such a shame they abandoned RealSense.

Still, $300 is not $100 and presumably they were selling at big loss, otherwise they wouldn't have shuttered RealSense.

Apple has ToF lidar for face recognition for years now. It's a matter of spec.

Similar style single chip lidar for automotive is in engineering sampling phase now [1]. Price remains to be seen but anything sub 1k would be a no-brainer to add to a robo-taxi.

Oh, everyone in the industry thinks Tesla is .. how to put it nicely .. is irrelevant for the future because of their CEOs stance on sensors. Camera will never be enough.

https://scantinel.com/2023/07/03/scantinel-photonics-launche...

If you buy en masse maybe. We buy such devices on or few at a time for industrial use cases, and those will cost you 10k€ for the big ones, and maybe less for the smaller ones. lots of development happening in the space tho.
Humans don’t use lidar, which clearly shows that a vision-only robotaxi is very much feasible.
Birds have to flap wings while our planes don't have to. There is absolutely no reason to limit self-driving cars in the same way our bodies are limited.

When it comes to AI though, humans are using biological neural net much more capable than any today's AI you can cram into a car. So, even if one accepts your premise of targeting human performance as a design guideline, more sensors is still logical at this point as way to compensate for the weaker AI.

Also, if you read how Tesla does vision it is very different from, and i think inferior to, how your eyes and brain build the 3d map of the surroundings. If one is limiting oneself to only vision, the first thing would be to try to get as good as possible that 3d mapping, and the vision seems to be among the simplest and most researched brain functions, ie. easiest to reproduce. As Tesla doesn't seem to be doing it - only may be couple years ago they only started to elicit the 3d model - i think they aren't on the shortest path to success when it comes to FSD.

Planes do ”flap their wings”, just not the ones protruding from the fuselage.
I think you're mistaking rotating for flapping. Rotation is one of those fundamental things differentiating our technological civilization from Nature.
Those rotating things still produce their thrust by pushing a wing-shaped structure through air, producing a high-pressure zone on one side, and a low-pressure zone on another. That is what I was getting at. It is the same principle.
No, it is different. A prop or fan blade is inmovably attached to the shaft and pushed through the air the same way like the plane's wing, and the blade isn't flapped like the bird's wing.
> Rotation is one of those fundamental things differentiating our technological civilization from Nature.

Rotation is very common in nature.

Planetary rotation, inner-core rotation, spinning galaxies, dung beetle rolling, Keratinocyte migration, Rotifers, spirals, rotational symmetry, etc.

What isn’t common (but not non-existent) is using rotation for locomotion in biology.

Many plants and trees spread rotating ”helicopter seeds”. Many vines roto-grow themselves around vertical supports. Day flowers rotate to follow the sun.

Apples and oranges fall on the ground and can roll far and wide. Walnuts too.

Partial rotation is still rotation, of course: see animal joints in walk, trot and gallop.

And then there’s the belly-up pig drunk on brewery grain rolling down the hill. That mash packs a wallop!

Huge swaths of microbes use electrostatic rotary motors driving screw type propellers, so if not say it’s that uncommon.
Humans don't act based on visual patterns alone though. We act based on our understanding of the world as a whole, including the intentions of other humans.

For instance, when we see a ball rolling onto the street, we know that there is probably a young person nearby who wants that ball back. We don't have to be trained on the visual patterns of what might happen next.

Of course AI can be trained on the visuals of high probability events like this. But the number of things that can potentially happen is far greater than the number of training examples we could ever produce.

> the number of things that can potentially happen is far greater than the number of training examples we could ever produce

Models don't need to have been trained on every single possibility - it's possible for them to generalize and interpolate/extrapolate.

But, even knowing that it's theoretically possible to drive at human-level with only the senses humans have, it does seem like it makes it unnecessarily difficult to limit the vehicle to just that. Forces solving hard tasks at/near 100% human-level, opposed to reaching 70% then making up for the shortcoming with extra information that humans don't have.

>Models don't need to have been trained on every single possibility - it's possible for them to generalize and interpolate/extrapolate.

They do have some in-distribution generalisation capabilities, but human intentions are not a generalisation of visual information.

"human intentions are not a generalisation of visual information" is a bit confusing category-wise. Question would be to what extent you can predict someone's next action, like running out to retrieve a ball, given just what a human driver can sense.

Clearly that's possible to some extent, and in theory it should be possible for some system receiving the same inputs to reach human-level performance on the task, but it seems very challenging given the imposed constraints.

Also, for clarity, note that the limitations don't require the model be trained only on driver-view data. It may be that reasoning capability is better learned through text pretraining for instance.

Humans don’t have radar, or thermal cameras, or ultrasonic sensors, doesn’t mean planes and boats shouldn’t use those
Humans eyes are an order of magnitude better than the cameras in a Tesla. Humans also have a database in their head and remembers how to behave in certain situations. FSD doesn't have any database of any kind.
That same argument can be used for all companies to fire all their employees. They are all human after all. Just implement all the needed features in hardware and software, done.
Humans use our brains to drive. Unless you're planning on popping an actual human brain or something that can perform equivalently into the car, you'd do well to consider more superior sensor suites.
Humans continuously move their heads in three dimensions to infer depth.

Cars can't do this.

And not surprisingly the biggest problem with FSD is the accuracy of its bounding boxes.

Citation? Humans are not constantly moving their heads to the degree that chickens do, and I find it doubtful that the micro movements from our head (which our eyes have to adjust for with the vestibulo-ocular reflex so things aren't blurry, similar to image stabilization in cameras) are large enough to infer depth.
I never said people are moving their heads like chickens.

But we do move our heads around pretty frequently. Enough to build mental records of what the bounding boxes are going to be for a range of objects.

We're not doing that while driving, though.

If we're talking purely about going off memory, there's no reason why machines couldn't build up a similar catalog (which could be used by every self driving AI once learned). And human ability to judge distances varies significantly between drivers.

feasible? I want the thing to drive better than me, especially in the rain, fog, and the dark!
I can't tell if this is satire, or if replicating 6 million years of evolution has legitimately become handwave material for Elon's supporters...
They are afraid, times of crisis - especially planetary one, have the weaker minded and scared ones always rally around figureheads. Some guy in operetta uniforms, exclaiming "Im the captain, give me all your cash" brandishing a detached steering wheel is what the passengers want to see. Reality be a lovecraftian horror to much to bear.
Mammalian vision and vision itself have been around a lot longer than 6 million years by at least one, likely two, orders of magnitude.
I don't know if you've tried this recently, but take a photo of something on your phone and put it into an AI.

There may even be an AI built into your photo library app.

The fact I work on self-driving cars makes me a tiny bit more of a realist than someone who thinks CLIP is proof of what AI can and can't do...
I'm curious. Can you elaborate on what CLIP proves about what AI can and can't do?
My point is that it doesn't.

The fact your phone can identify an object doesn't inform you on the capabilities of self-driving car's vision stack. It's complete non-sequitur.

So your job is to, in your own words, be "replicating 6 million years of evolution"?

You know how big your own team is, and that your team is itself an abstraction from the outside world. You know you get the shortcuts of being able to look at what nature does and engineer it rather than simply copy without understanding. You know your own evolutionary algorithms, assuming you're using them at all, run as fast as you can evaluate the fitness function, and that that is much faster than the same cycle with human, or even mammalian, generational gaps.

> CLIP is proof of what AI can and can't do

CLIP says nothing about what AI can't do, but it definitely says what AI can do. It's a minimum, not a maximum.

Not to be rude but you're arguing with somebody that works in what I would assume is a highly mathematical space and asserting your opinion on how quickly that highly mathematical space can advance while your own profile admits that you were unable to understand "advanced calculus or group theory" and your own github indicates that you are stuck on "the hard stuff — abelian groups, curls, wedge products, Hessians and Laplacians" because you "don't understand the notation." Your opinion on the speed of advancement just doesn't seem informed?

Maybe this is an old post and your understanding has dramatically improved to the point where you're able to offer useful insight on ML/AI/self-driving?

https://benwheatley.github.io/blog/2024/03/11-12.00.16.html

You mean "very much theoretically possible".
s/feasible/possible/
Think of pile-ups. No matter how good a driver you are there are situations where you cannot prevent crashing. But lidar can.
Pile ups happen because people drive:

- Over the speed limit (it's called a limit for a reason)

- Too fast for the conditions (speed limit != speed target)

- Too close to the vehicle in front of them

There are very few situations that can't be prevented by driving properly in the first place.

Pray tell how a Lidar prevents crashing in this situation?
Accurately determine distance to objects in almost no time. While a human has 1 second reaction time. There will be situations a fast reaction time alone can save.