|
|
|
|
|
by DrFrugal
552 days ago
|
|
even though I do not like it too much, i think i will have to pry apart the MPQ files into it's distinct parts, and write them down to the filesystem individually (and then deleting the original file) - basically what i wanted to do with the extents, but instead have them as distinct files.
this then can be reversed via script to assemble the original archive file on demand to get a byte-by-byte equal file again.
writing the parts down to the filesystem will cause the parts to be properly block aligned, and be able to be hardlinked, if they exist multiple times on the filesystem - this cuts down on metadata even more and also boosts performance when doing block/extent deduplication, since a single inode is only processed once in most proper deduplication programs. the MPQ files range from a few MiB to around 2.5 GiB.
since the access should be rather fast, pairing them as an archive file is not an option for me. thanks about the hint of the merkle trees, i will read up on what that is... always good to know about different approaches to a problem :) |
|