|
|
|
|
|
by hm-nah
784 days ago
|
|
A jailbreak doesn’t “make a model do something actually bad”. A jailbreak makes it trivial to “provide a human who wishes to do bad, the info needed to be successful”. Depending on the severity of the info and the diligence of the human, by the time you “see evidence of a real threat”, you could be enjoying a nice sip of the tainted municipal water supply. This ain’t a joke. |
|
Yes it is. Libraries and the internet have made finding 'harmful" instructions trivial for decades, if not centuries.