|
|
|
|
|
by fshbbdssbbgdd
845 days ago
|
|
If you make a language model which just predicts the next word without trying to shape the output to be “good”, what you get is GPT instead of ChatGPT. You try to ask it a question and it will keep asking more questions (with a similar writing style). Or curse at you, or change the subject. RLHF was the breakthrough that fixed this. We used human feedback to train the model to give “good” output. Ok, so probably we agree that the product needs to try to be “good”. Cue a million opinions about what “good” is. Whatever comes out is the result of a value judgment, there is no getting around it. Same issue comes up with images from Gemini in this case, or any other generative AI product. You don’t actually want the AI to be unbiased because that output would be hot garbage. |
|
Now we don't even get log probs, one of the most powerful features to evaluate the output of the models.