Hacker News new | ask | show | jobs
by goldemerald 8 days ago
I was able to replicate OP's attack. Since ChatGPT generates images via a separate model, I was able to ask it to tell me what the inputs to the tool was. It's a null prompt: a completely unconditional image generation. What I'm not sure of is if these are the average image trained on that had no prompt in the dataset, or if they are the true average of the dataset during unconditional training step. Very interesting nonetheless, as typically researchers are only able to see the unconditional generation of open weight models.

Surprisingly when you ask ChatGPT to generate you an image with these tool params, the output is not the same; it's not remotely graphic.

  prompt: null
  size: null
  n: null
  transparent_background: null
  is_style_transfer: null
  referenced_image_ids: null
Edit: after more debugging the image generator does seem to look at the conversation as part of the input conditioning, so the one word change from OP makes more sense. There seems to be a hidden prompt rewriter that looks at the tool's prompt and the conversation to create the final conditioning for the t2i model.