Y
Hacker News
new
|
ask
|
show
|
jobs
by
vhcr
197 days ago
I'm guessing you're not storing the CLIP for every single frame, instead of every second or so? Also, are you using the cosine similarity? How are you finding the nearest vector?
1 comments
laidoffamazon
197 days ago
I split per scene using pyscenedetect and sampled from each. Distance is via cosine similarity- I fed it into qdrant
link