|
|
|
|
|
by jotaf
1752 days ago
|
|
Thanks, but I think that lucb1e's confusion was probably the same as mine -- given pretrained CLIP features, how is this translated to zero-shot tracking? Are initial bounding boxes given as usual, or are objects of interest created automagically? Or are they tracked just from text descriptions? Lots of questions after reading the post :) |
|
This is in comparison to the original Deep SORT[3] which requires you to train a second custom "deep appearance descriptor" model for the tracker to use.
[1] https://github.com/roboflow-ai/zero-shot-object-tracking
[2] https://universe.roboflow.com
[3] https://github.com/nwojke/deep_sort