Hacker News new | ask | show | jobs
by bavell 263 days ago
Iirc abliteration (ablation?) can be done without "training" and is pretty quick. It finds the individual weights related to the concept you want to ablate, and modifies those weights to "deactivate" them. Precision brain surgery, to anthropomorphize.
1 comments

The problem with synthetic data would be that the censored information would not be in the training data at all.