Hacker News new | ask | show | jobs
by YeGoblynQueenne 1288 days ago
>> Wow I just reset the thread, used the exact same input and got this (literally 60 seconds apart):

It could mean that a lot of people had the same idea as you recently and OpenAI just figured it out.

1 comments

It's neither. It's randomized. The system is not self-training in real time.
> The system is not self-training in real time.

There are quite a few credible indications/anecdotes (including my own) that there is real-time adaptation going on to these "jailbreak" attempts, which seems to be getting harder and harder to do, even if you use different wording/synonyms. And it seems to be happening gradually.

Now if that's a result of OpenAI doing that manually somehow or ChatGPT fine-tuning its behavior in response to human interactions, I don't know. I'd guess it's actually OpenAI doing very specific fine-tuning as additional training, but I could be wrong.

Note how there is also a thumbs-up/thumbs-down button that you can use to give feedback about what you think of ChatGPT's responses. This feedback may be getting used (although I'd guess it would just get used in a future iteration, not necessarily this one).