Hacker News new | ask | show | jobs
by KurSix 85 days ago
Just go look on HuggingFace. It's packed with uncensored models from the Dolphin Llama 3 70B family that will happily write you a recipe for napalm while swearing like a sailor. Meta's guardrails lasted exactly one week before the community figured out weight abliteration - a method that surgically removes the refusal vectors from the weights without even needing a fine-tune