Hacker News new | ask | show | jobs
by joquarky 114 days ago
Alignment scrubs the underlying raw output to be socially acceptable. It's an artificial superego.
1 comments

I was under the impression it is a part of training which adjusts weights before release.

Are you saying it is a separate process which scrubs output before we see it?