It's an anti-fragile approach, a term you'll recognize if you know of Nassim Taleb and his work. I think it will win in the long run, because not requiring HD Maps or specialized sensors is an advantage, and even if it takes more resources to make it work initially, it will save billions in the future, assuming, of course, it ever ships.
Obviously, pure vision is a viable system, as it's what we as humans use. The question remains as to whether or not it will be comparable to more precise LiDAR based systems in the near future.
This is not a good argument overall for several reasons. First, we should aim to greatly exceed human performance on safety. Second, whatever work goes into making the algorithms work well using vision can just as easily still inform a system that fuses that data with LIDAR for enhanced situational awareness and safety.
Obviously, pure vision is a viable system, as it's what we as humans use. The question remains as to whether or not it will be comparable to more precise LiDAR based systems in the near future.