Hacker News new | ask | show | jobs
by bayindirh 273 days ago
Tesla killed its brand reputation thrice.

- First they went "camera only", alienating people who knows the tech.

- Then they mocked car industry for so long. It was a necessary poke at first, but they didn't get prepared, and the elephant proved that it can run.

- Then Elon's Trump affair and all the shebang happened.

The broken FSD promises, using non-auto rated parts (and related failures), being negligent of their own errors and acting like they are deaf to the criticism is the cement between the layers.

2 comments

Tesla‘s “vision only” with phantom braking suicide experiments reached court last year in Germany and for the first time the court proved they exist for real and the cars are dangerous. This will be interesting to watch. I tried often free autopilot in model Y and it hits hard the brakes on empty road every other time. Afterwards I stopped using it completely. The car is nice, but without working assistance systems. Lane keeping also does not work reliably. Model Y is nice electric car for people without much requirements like me - it’s spacious and electric range is acceptable.

About the process in court in German: https://teslaanwalt.de/autopilot-als-sicherheitsrisiko/

They had camera-only tech employing multiple 4k cameras running at over 2000fps. Not your grandma's 480p/25fps webcam many car manufacturers use as parking cameras. 2000fps gives you enormous safety margin even in case of individual frame misdetection. The long-tail issues they hit are present on LiDAR vehicles as well but LiDAR is much slower, more difficult to process and sensor fusion adds its own errors.
> The long-tail issues they hit are present on LiDAR vehicles as well but LiDAR is much slower, more difficult to process and sensor fusion adds its own errors.

The long tail is long no matter what. Which is why the most robust solutions deploy sensors with orthogonal sensing modalities that can compliment one another. By relying on only one sensor type, Tesla has made it hard for their system overly brittle, which has resulted in avoidable deaths and destruction.

> LiDAR is much slower, more difficult to process

LiDAR in my experience is much easier to process, as the sensor stream is just an array of distances. Camera in my experience is much harder to process, as the sensor stream is an array of RGB values from which you have to infer distances. So by what metric are you alleging LiDAR is more difficult to process?

> sensor fusion adds its own errors.

You'll have to do a degree of sensor fusion across all the camera sensors anyway, so going camera-only don't absolve you of having to fuse sensor streams and come up with a belief. Sensor fusion in general tends to decrease overall system error as more sensors are added.

Googling suggests the Tesla cameras do 36 fps https://x.com/nextbigfuture/status/1895692497898390021
as expected too heavy of a data stream to be true. resolution is lower than 4K while frame rates aren’t even in the ballpark of 2000 fps.

https://www.blogordie.com/2023/09/hw4-tesla-new-self-driving...

I have this first-hand from the FSD team at Tesla from 5 years ago. Who knows where they are nowadays. You can believe whatever you like.
There's no way the FSD team overstated its capabilities, right?
Everything is possible. They also might have used some creative metrics giving 2000+ fps. I don't know. Or they might have found some neat trick nobody thought about before.
That would be something like 371Gbps (some assumptions) raw data to process, per camera. I would assume a lot of shortcuts to get that down, but still an unreasonably huge scope to process in "real time" in a car.
Assuming of course that the relevant computer vision components can run at 2000 fps as well.. I highly doubt it.
Running at 2000FPS in low light (and getting meaningful data at that sensor size) is also impossible to begin with. Even if you can do constant 60, you're in good shape.

2000 can be good for doing multiexposure and maybe detecting fine movement, but assuming that everything running 2000FPS (and processing 16000 frames/sec) is not a simple thing, esp, if you're running in an uncontrolled and chaotic environment.

I was about to comment the same, 2k FPS means a maximum shutter speed of 1/2000, you need a lot of light to capture an image this quickly, in low light conditions it's simply impossible to capture enough light even if using very high end optics and sensors.
I don't know the specifics, maybe they are timing individual cameras in a way they achieve 2000fps with a crisp image in each camera and merging them together. Or maybe they are using some MIT tech that was able to capture super low light conditions.
Being able to capture in super low light conditions is dependent on two things. 1. Your sensor's noise floor, 2. The number of photons you can get per unit time.

First one is dependent on the manufacturing process, and the second one is dependent on your sensor size.

Currently, the leading sensor manufacturers (namely Sony Semiconductor and Canon) are doing very low noise sensors. However, to get both these low noise levels and convincing images needs full frame sensors, at least. APS-C can somewhat close, but it can't be there (because physics).

Even in that case, you can't do 2000FPS and get meaningful images from every one of them.

There's no way that a Tesla car cam sports full frame or APS-C sensors.

So, it's physics.

Or, just maybe, you are completely wrong and misheard or were lied to.
Why 2k FPS? I'm not being facetious; human eye sees, apparently, at around 25fps, which is why this is what TVs and cinemas used to use. At that rate, and 144kph, say, the car moves 1.4m between frames.

Fine, so maybe you think this is too much. But 10x this still gives you 14cm between frames, at what is already speeding in most jurisdictions I know of.

2000 FPS seems to my untrained eye like a problem, not a feature.

> human eye sees, apparently, at around 25fps

What do you mean by "sees"? I'll bet you that you can't walk around wearing a VR headset running at 25 FPS for more than 30 seconds without violently emptying your stomach. Trying to watch a movie on a display that doesn't exhibit motion blur also makes me motion sick.

Human brain doesn't see in terms of frames at all. There's a limit where an increase in FPS likely becomes imperceptible to most people but that limit is at least 10 times higher (from personal experience), likely more.

Humans don't see around 25fps, it depends but the maximum for a trained person or competitive player is 300hz
Because you are processing a sequence of a fixed length in deep learning models and the more frames you have, the more accurate your FSD output is. Driving 1.4m between the frames with single-frame accuracy of 80% is quite risky and input correction quite discrete; 14cm is still risky for a proper trajectory planning. Now make it in millimeters and suddenly your trajectory is nearly perfect with only little noise.
5mp 36fps. 1mp on older Teslas.
2000 FPS is impressive.

Not detecting overturned semis, road debris, and swerving to road dividers is even more impressive with that tech.

Where a relatively simple radar can prevent without running a slow-motion camera rig and a wannabe supercomputing cluster on the car.

To be frank, I'm not against 2000 FPS cameras, but I can't come into terms with not adding a simple radar to detect something unknown is dangerously close and the land missile needs to stop.