Hacker News new | ask | show | jobs
by forthorbor 1284 days ago
Rather than hardcoded exceptions, they should train the model to recognise when someone is attempting a harmful prompt.