Hacker News new | ask | show | jobs
by vhcr 197 days ago
I'm guessing you're not storing the CLIP for every single frame, instead of every second or so? Also, are you using the cosine similarity? How are you finding the nearest vector?
1 comments

I split per scene using pyscenedetect and sampled from each. Distance is via cosine similarity- I fed it into qdrant