Hacker News new | ask | show | jobs
by bitforger 1434 days ago
We do this with Pinecone, but we use CLIP embeddings of images, and they work incredibly well. It's kind of crazy how easy it is to get semantic search of images these days.

CLIP also does caption embeddings, so you can lookup images via both images and captions.

2 comments

Seconding the recommendation of CLIP embeddings, especially compared to image histograms + requiring OpenCV.

I wrote a naive, minimal dependency Python package to calculate image embeddings (https://github.com/minimaxir/imgbeddings) with some lookup demo notebooks and it works well in a pinch, although it's due for an upgrade.

Hey! We love CLIP and plan to cover it later in this series.