Hacker News new | ask | show | jobs
by xucheng 1770 days ago
In addition to generate the adversarial collisions, someone mentioned that it can also be used to train a decoder network to reverse any NeuralHash back to its input image.
3 comments

That assumes that 96 bits of information are sufficient for (in some sense) uniquely describing the input image. Which, on the one hand, is of course the purpose of the system, but on the other is also clearly mathematically impossible (a 360x360 RGB8 image has 3110400 bits of information).

That is, for each 96 bit neural hash value, there exist (on average) 2^3110304 unique input images that hash to that same value.

Again, these are of course trivial facts, which do not rule out that image recovery (in a "get back something that looks similar to the original input" sense) is possible, but you should be aware that "similar" to the network need not mean "similar" to a human.

Just like any autoencoder, it is not about getting back the exact original, which is of course impossible. It is about summarizing the image in 96bits information, which is quite enough to leak the gist of the original image. For example, [1] talks about reversing Microsoft’s PhotoDNA.

> but you should be aware that "similar" to the network need not mean "similar" to a human.

With techniques like GAN and DLSS, it is quite possible to generate some photo realistic image being enough similar to the original one, or at least leaking some private information.

[1]: https://www.hackerfactor.com/blog/index.php?/archives/929-On...

"...but you should be aware that "similar" to the network need not mean "similar" to a human..."

EXCEPT... neural hash also claims to be robust to modifications to images that would result in a similar-to-human-image. If the 96 bits is enough to tag such similar-to-humans results, why couldn't a brute force approach yield such similar-to-humans images? Indeed, a nefarious person intent on producing CSAM could set-up something like a generational-adversarial system that the produced CSAM images using the hashes along with other clues.

Because there are still an absolutely overwhelmingly huge number of different, completely nonsensical images that all generate the same hash, and small perturbations of those nonsensical blobs also generate the same hash.

96 bit is just not enough data to generate anything meaningful, just give up on that thought.

This absolutely needs to be done. Also, does Apple deploy different models for different regions/cohorts?
> someone mentioned that it can also be used to train a decoder network to reverse any NeuralHash back to its input image.

That someone is simply wrong.