Hacker News new | ask | show | jobs
by visarga 1252 days ago
Check out Constitutional AI from Anthropic. They automated "RLHF" by simply writing a few rules.
1 comments

Thanks, just skimmed the paper, I think that "they automated RLHF" statement is maybe too strong here, there is still manual process but it seems like they optimized away a lot of manual labeling work.