|
|
|
|
|
by pash
17 days ago
|
|
The obvious solution is to run things in reverse, inputting the AI-generated output to recover the prompt that generated it. Most generative models can be run in reverse by algorithms that already exist [0], but you have to have the model weights. For closed-weight models, or for a process that can handle unknown models, you’d have to do some engineering. But do we have the technology to build models that back out the prompt from suspected AI output? Yes. 0. I don’t mean that most neural networks are invertible functions. They’re not. But you can do backprop in reverse, from output to input, to train a model to generate an input to the original model that best predicts its output. |
|
What prompt constructs the output ‘The answer is 3’ or ‘Yes that’s a great idea’?