Hacker News new | ask | show | jobs
by jknoepfler 1115 days ago
Forgive me if this seems like a knee-jerk response, but literally zero human drivers use "vision alone" to drive. Humans drive with a spacio-temporal model of the world, visual, auditory, and haptic feedback, logical/symbollic rules about driving norms, emulative models of other drivers/agents in the driving environment, ethical judgments about what it is ok/not ok to collide with... and so on and so forth.

Reducing that to "vision" doesn't even make superficial sense.

1 comments

I always wondered, and don't remember seeing it specified one way or another, do self-driving AIs use any sort of temporal modelling for the environment?

A model which can use predictive behaviour for the objects that the visual part detects seems orders of magnitude better than one that just does visual detection from scratch. Seems like a huge wasted opportunity to have to model the world starting from zero for every frame they receive from the cameras.

Objects that pop-in and out of the field of vision due to occlusion or other reasons seem to trip Teslas (in at least some of the reported incidents), but even like that it's hard to believe that they didn't implement such an obvious improvement.