|
|
|
|
|
by tsimionescu
924 days ago
|
|
We don't know how it works, and the parts that we do know are very hard to implement. In particular, our vision is very very tied to our model of the world, and is clearly an evolutionarily derived trait, not something we learn from scratch as children. We are basically born with a model that can augment the raw sensory input with basic physics (object permanence, but also we have an intuition for things like "heavy = large momentum even at slow speeds", and for "what goes up will come down", some basic optics of how objects cast shadows, and others). In addition, we learn to recognize objects and we know their actual sizes from past experience, so we typically estimate distance this way far better than relying on parallax. "Small image of car" is automatically "car far away", even if seen with one eye, because we know what size a car is. In fact, much of our brilliant vision starts going awry immediately in an artifical setting with misleading sized objects. |
|
What you describe is largely the classical "cognitivist" model, which is more or less dead in the contemporary tought. We don't even have a very good account of depth estimation, apart from "bunch of things seem to affect it".