Hacker News new | ask | show | jobs
by ggm 509 days ago
How did you go with duplicate detection? (For context, aside from being amazingly generally useful to anyone with non trivially edited images, lots of hotel chains reuse content)
1 comments

Perceptual hashes are very good for that, maybe with some adjustments for mirrored images and some crops
Can use dhash that is more robusts to compression, crops and color changes. Mainly look at the features in the image.
The README explicitly says it's not good for crops, and from the algorithm description it can't be. Or is there another thing called dhash with a different algorithm? https://github.com/benhoyt/dhash. https://github.com/Rayraegah/dhash is similar but horizontal only.
Yes. Most of the systems I used a couple of years ago (immich for instance) were still quite immature.