Hacker News new | ask | show | jobs
by nitrogen 2990 days ago
Structure from motion is an existing technique. What is the contribution of ML in this case (it seems like joint positioning maybe?)?

https://en.m.wikipedia.org/wiki/Structure_from_motion

2 comments

99%¹ of computer vision problems are 80% solved. The problem is, you need 95+% solution to be practically useful.

Binocular stereo vision has just approached general applicability, and SfM is mostly used in very constrained environments (traffic analysis) or with large computational resources with manual correction (offline 3D mapping from aerial data).

¹ Numbers are metaphoric only, based on experience in scientific and industrial CV.

SFM does not automatically provide joint locations. Also, a casual 360 video around a subject does not provide enough data for producing a full body mesh.