|
|
|
|
|
by sashank_1509
1207 days ago
|
|
They seem to be only testing for the image retrieval task, but I don’t think CLIP is actually used for image retrieval. Most cases, I see CLIP being used for semantic segmentation, detection etc. Do these guys have similar results on these tasks? |
|