Hacker News new | ask | show | jobs
by kgraves 720 days ago
Can this be used for image to 3D generation? What is the SOTA in this area these days?
2 comments

I think what they did here was go text prompt -> generate multiple 2d views -> reconstruction network to go multiple 2d images to 3d representation -> mesh extraction from 3d representation.

That's a long way of saying, no, I don't think that this introduces a component that specifically goes 2d -> 3d from a single 2d image.

The paper suggests Rodin Gen-1 [0] is capable of image-to-shape generation.

[0] https://hyperhuman.deemos.com/rodin