|
|
|
|
|
by glalonde
2856 days ago
|
|
What about pose estimation? e.g. Given a well defined coordinate system, like the origin is the nose on a face, determine the pose of the face. Is this still best done with classic optimization formulations like ransac/ICP and a supplied model, or have these been bested by learned models somehow? |
|
With that said, I think there's still a ton of merit in classical geometric approaches like ICP — there's a real, geometric basis to why they work. Convolutional networks can demonstrate some pretty amazing results, but they're still mostly "black boxes" to us, and a consequence of this is that it's hard to understand why they work and predict when they'll fail. This blog post (by the PoseNet author, actually) articulates the viewpoint well: https://alexgkendall.com/computer_vision/have_we_forgotten_a.... One recent research direction that I personally find really fascinating is designing deep learning architectures around real geometric properties, e.g. as in Skydio's deep stereo work: https://arxiv.org/pdf/1703.04309.pdf