| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by mdale 1202 days ago

There is nothing to prevent a robust hierarchy of rules and training that impacts levels of permissions per operator intent.

OpenAi has made a lot of progress on this in a very short amount of time. Casual jailbreaking or negative role playing is already 100x more difficult then early versions via the ChatGPT chat interface.

We will see more sophisticated robust adversarial filters to untrusted content going forward.

1 comments

TisButMe 1202 days ago

Possibly yes - I think that's my point with predicting peak oil wrong for 50 years. Still, right now it seems every time OpenAI/someone else adds a new content filter, someone figures out a prompt escape that works.

link