Hacker News new | ask | show | jobs
by int_19h 418 days ago
The "meaningless praise" part is basically American cultural norms trained into the model via RLHF. It can be largely negated with careful prompting, though.