Hacker News new | ask | show | jobs
by jagraff 1207 days ago
The most interesting part is that the author can "coerce" GPT into giving a completely opposite answer based on requiring the first token to be Yes or No, and the ways that sometimes it skirts around that without breaking the rule.