|
|
|
|
|
by chmod775
477 days ago
|
|
The Venn diagram of people to whom this comment contains no new information and those who know what "RLHF" means is almost a perfect circle. For anyone not part of that intersection: RLHF means reinforcement learning from human feedback. |
|