Hacker News new | ask | show | jobs
Explaining Reinforcement Learning with Human Feedback (RLHF) (surgehq.ai)
11 points by echen 1263 days ago