Hacker News new | ask | show | jobs
by oezi 37 days ago
Would it be feasible to do a soft RLHF using steering when an agents gives an undesired response?