|
|
|
|
|
by seafoamteal
414 days ago
|
|
It does feel like they've dialed up the model's tendency to agree with users and are dialing down the safety. My friends and I were trying to jailbreak ChatGPT by asking it to tell us how to make potentially dangerous chemicals (now, we don't know if the answers were correct, for obvious reasons) but it took only the bare minimum of creative framing before GPT happily told us the exact details. We didn't even try anything new. Surely 3 years into this, OpenAI should be focusing more on the safety of their only product? |
|