|
|
|
|
|
by dasbo
2538 days ago
|
|
It's best not to think in GB terms when talking about AD datasets. E.g., when you record raw data of a multisensor setup (lidars, radars, cameras), the data rate can reach 10+ TB/h. Camera-only datasets are in comparison much smaller. Taken out of the argoverse dataset description:
- One dataset with 3D tracking annotations for 113 scenes
- One dataset with 327,793 interesting vehicle trajectories extracted from over 1000 driving hours
- Two high-definition (HD) maps with lane centerlines, traffic direction, ground height, and more 1000 driving hours is ok'ish for research (imho). |
|
Yes, it's certainly not yet clear if the dataset is large enough to capture useful variance. But unlike kitti or cityscapes, it's large enough to present a computational challenge to most of the machines & budgets in research use today, so there's a pretty good chance it'll help push the state of the art... Perhaps for more than one art. The API code itself has a lot of low-hanging fruit: https://github.com/argoai/argoverse-api
nuscenes is good too https://www.nuscenes.org/
Waymo will have one as well some day https://waymo.com/open/
One attractive aspect of these datasets is that they help open up the question of safety for public discussion. For example, now anybody can throw off-the-shelf object detection at these datasets and see what a realistic F1-score looks like for objects at 30m, 50m, 100m, etc...
In argoverse and nuscenes, you have track labels, so you can furthermore factor the velocity difference into how you weigh the error. Have you ever been hit by an Uber? Even 5mph can cause a lot of damage.