I wrote a DB for PNG files which deduplicates PNG blocks (only exact matchs): https://github.com/albertz/png-db