Hacker News new | ask | show | jobs
by mrinterweb 2 days ago
It kind of sucks, but I get the silent change. If a user was trying to use the model for something untoward, having a rejected prompt would just give signal to train on how to eventually successfully bypass security measures.