Y
Hacker News
new
|
ask
|
show
|
jobs
Explaining Reinforcement Learning with Human Feedback (RLHF)
(
surgehq.ai
)
11 points
by
echen
1263 days ago