Hacker News new | ask | show | jobs
by jmalicki 129 days ago
They ran out of passively collected data. RLHF allows them to gather deeper more targeted data.