Their decision to rely on vision got them billions of miles of training data from every corner of the USA. It did not get them point clouds, but monocular depth estimation works extremely well these days, so would it really have been that valuable? It looks to me like they made a shrewd bet and won big.
Seems like a mistake not to be a sensor maximalist. Why not try and get as many data streams onboard as possible. Seems like the generalized robot OS would be better if it was a data fusion platform.
That’s a good point. I coming from the perspective that waymo is more of a “dead reckoning” approach compared to Tesla long(and maybe never ending) road.