Hacker News new | ask | show | jobs
Breaking RLHF “Safety” (And how to fix it?) (lesswrong.com)
2 points by maxmusing 1022 days ago