Hacker News new | ask | show | jobs
by kitd 1 day ago
Funny that coding agents have personalities, including "that colleague" you want to avoid even if you know they're probably quite good at what they do!
1 comments

That's exactly what RLHF is for.

(In fact, "that colleague" might have even been the source of the RLHF training set.)