Hacker News new | ask | show | jobs
by fc417fc802 60 days ago
So you're saying that blackhats will be required to do a small bit of roleplay if they want the model to assist them? I'm not against public access BTW just pointing out how absurd that PR oriented "safety" feature is. "We did something don't blame us" sort of measure.

It isn't even my intent to naysay their approach. They probably have to do something along those lines to avoid being convicted in the court of public opinion. I just think it's an absurd reality.

2 comments

It's a liability shield and helps to avoid unsavory headlines in the news
the role-play makes it harder to fully automate attacks, which is the real fear
I bet you can use a not gated model as intermediary to do the role-playing job for you.