Hacker News new | ask | show | jobs
by freeamz 422 days ago
Interesting. How does this compare to abliteration of LLM? What are some 'debug' tools to find out the constrain of these models?

How does pasting a xml file 'jailbreaks' it?