|
|
|
|
|
by buildbot
878 days ago
|
|
Disagree there. Humans have massive compute, dual optics, and amazing filters. Computer vision has 1-2 of those three, and I don't think we are near an AGI for self driving yet. Driving is IMO, an AGI level task. Does you dataset have a crocodile in it? Does you monocular depth model get fooled by a billboard that's just a photo? |
|
This is actually a pretty clever example, I tried a few billboards on the demo online and, as these models are regressive so they output the mean of the possible outputs, sometimes the model is perplexed and doesn't seem to know if to output something completely flat or that actually has a depth, and by being perplexed it outputs something in between.