Hacker News new | ask | show | jobs
by harperlee 1375 days ago
> If there was a Borgesian book containing, say, all possible 512x512 images, one on each page, then surely two people having the same copy of that book wouldn't need to exchange images, they could simply exchange page numbers, and see exactly what the other one is referring to.

If that were the case, the page number would be as long as the image, and exchanging one or the other is the same. Heck with proper order arrangement, the page number IS the image, in a known format.

In other words, the information content of such book is exactly 0.

1 comments

What could contain information is a book of all 512x512 images that a human being would perceive as being "an image". Ie, the vast majority of possible 512x512 images look like random noise to humans. Excluding those massively shrinks the size of the book.

So that does mean image model AI like dall-e/SD are effectively compression over this space of images that they can generate (which is at least attempting to emulate the space of 'meaningful images' to a human), since given a seed, they'll deterministically produce the same image, and that seed is much smaller than the information needed to describe every pixel in the image.

Ah yes, conceptually, SD is a filter that removes noise (~ things that look like noise to us) from the space of all possible pixel combinations. It's interesting that it's also how it works in practice.