Hacker News new | ask | show | jobs
by PreciousH 83 days ago
I think group dynamics comes with a turn taking ambiguity. unlike in one-on-one dialogue that's structurally clean since there's a clear prompt, a clear response, and a clear feedback signal for RLHF.
1 comments

Sure, messy to implement. But maybe that messiness is the fix. Clean 1-on-1 is exactly why AI learns to flatter — one voice, one signal, no pushback. Group is harder to train but harder to game