Hacker News new | ask | show | jobs
by lostdog 591 days ago
It's really hard to do in practice.

Yes, you can train an ML model on rendered data, but the model tends to fixate on rendering artifacts, and performance doesn't transfer to real world images. Plus, it's very difficult to make generated scenes with the variety and complexity of the real world. Your trained model will fail to generalize to all the distractions in natural scenes.

Yes, there are techniques for all these problems, but none of them are good, reliable, or easy to get right.