Hacker News new | ask | show | jobs
Reinforcement Learning from Human Feedback: When the Math Ain't Enough (evalovernite.substack.com)
1 points by scoresmoke 1042 days ago