Hacker News new | ask | show | jobs
by yeldarb 2443 days ago
Has anyone done this yet? I'd be curious to know if their dataset is "cleaned" or not.
1 comments

Spoiler, ish:

The last time I measured ImageNet JPEGs with EXIF orientation metadata, the number of affected images was actually quite small (< 100, out of a dataset of 1.28M). There are also some duplicates, but altogether it seems fairly "clean."