Models need to generalize over what they've been shown though. Most people won't learn to drive near an airplane or a wheelchair either, but would have no problem avoiding those.
Humans generate general 3d representation of the world. You don't know what this object is, yet you see it's borders. Tesla seems to generate understanding only of the objects that it has already seen.