|
|
|
|
|
by ACCount37
251 days ago
|
|
"Unwillingness to be harsh to the user" is a major source of "divorce from reality" in LLMs. They are all way too high on the agreeableness, likely from RLHF and SFT for instruction-following. And don't get me started on what training on thumbs up/thumbs down user feedback does. |
|