Hacker News new | ask | show | jobs
by SrslyJosh 1442 days ago
> the highest accuracy 56.8% AP among all known real-time object detectors with 30 FPS or higher

Yikes. It's not clear to me if that's the upper limit on accuracy or a limit imposed by requiring that it run at 30 FPS, but still...yikes.

1 comments

It's clearly the latter and I don't see why it would be "yikes". Real time detectors are useless if "real time" means 1fps.
What good is speed if the accuracy isn't significantly better than a coin flip?

From the paper:

> For example, multi-object track- ing [94, 93], autonomous driving [40, 18], robotics [35, 58], medical image analysis [34, 46], etc.

LOL, these are all great use cases for a model with < 60% accuracy!

> What good is speed if the accuracy isn't significantly better than a coin flip?

Because distinguishing an object as belonging to one class out of a thousand with 50% accuracy doesn't mean it's a coin flip. You'd need a thousand-sided coin. Random chance in that case is 0.1%, which maeks 50% way, way better.

The only issue with this comment is that it is _not_ what AP means for object detection... https://www.v7labs.com/blog/mean-average-precision

This is definitely not a coin flip, actually somehow close to what a human would produce, IMHO.

It’s more nuanced than this. It’s not “look at this image and tell me yes or no if there’s a car in it” it’s more like “tell me where all the cars are in this image, if any.” We use this a lot, and ramping up recall we can do some interesting use cases.

I assure you it’s highly useful in the real, real world.

And that's why nobody actually uses it for those things, at least not yet. Don't forget that advancement is often incremental, and that in this case advancement has actually been somewhat fast. YOLOv3 came out in 2018.