|
|
|
|
|
by vov_or
1203 days ago
|
|
Hi!
I am one of the contributors!
We were focused on image retrieval only. Almost all semantic search engines for images are based on CLIP today. We are also building a semantic multimodal search engine as a DBMS component. That is why Image retrieval is so crucial for us as well as inference perf.
Also, for semantic segmentation and detection, you probably use only the image encoder part of the CLIP. |
|