Hacker News new | ask | show | jobs
by ziddoap 618 days ago
>RLHFed

For those of us not steeped in AI culture, this appears to be short for "Reinforcement learning from human feedback".