|
|
|
|
|
by RC_ITR
1057 days ago
|
|
RLHF does change the parameters. The way to think about it is that backpropagation changes the parameters of a model so they get closer to some sort of desired output. In pre-training and SFT, the parameters are changed so the model does a better job of replicating the next word in the training data, given the words it has already seen. In RLHF, the parameters are changed so the model does a better job of outputting the response that aligns to the human's preference (see: the feedback screen in the linked article). |
|
So how can you update weights without doing back-propagation? Or is it still back propagation but with a different metric?