Y
Hacker News
new
|
ask
|
show
|
jobs
by
visarga
1252 days ago
Check out Constitutional AI from Anthropic. They automated "RLHF" by simply writing a few rules.
1 comments
lossolo
1252 days ago
Thanks, just skimmed the paper, I think that "they automated RLHF" statement is maybe too strong here, there is still manual process but it seems like they optimized away a lot of manual labeling work.
link