Hacker News new | ask | show | jobs
by jedberg 797 days ago
Most of the current models have a safety check specifically for "ignore previous instructions".