Y
Hacker News
new
|
ask
|
show
|
jobs
Reinforcement Learning from Human Feedback: When the Math Ain't Enough
(
evalovernite.substack.com
)
1 points
by
scoresmoke
1042 days ago