Hacker News new | ask | show | jobs
by neuroticnews25 351 days ago
That would make Grok the only model capable of protecting its real system prompt from leaking?
1 comments

Well, for this version people have only been trying for a day or so.
Providing a fake system prompt would make such jailbreaking very unlikely to succeed unless the jailbreak prompt explicitly accounts for that particular instruction.