Hacker News new | ask | show | jobs
by chmod775 477 days ago
The Venn diagram of people to whom this comment contains no new information and those who know what "RLHF" means is almost a perfect circle.

For anyone not part of that intersection: RLHF means reinforcement learning from human feedback.