Hacker News new | ask | show | jobs
by drawingthesun 1344 days ago
Do you have a source for this?

This issue has been claimed many times and I've heard that DALLĀ·E 1 & 2, Stable Diffusion & Midjourney all can create images that are exact copies of the training material.

This doesn't make sense considering the compression ratio of training images to model is about 1:25,000.

Further investigations I have made show that all these cases can be explained via the following:

1) The prompt included an image, so some form of image2image was used. Of course if you use an image as a base, and tell the model to stick closely to that image, the output will largely resemble that image.

2) The example was completely made up.

So far I have seen no evidence, given a text prompt, the output of an image containing some portion of any image from the training set.