Hacker News new | ask | show | jobs
by manscrober 283 days ago
a) 2022 is not too long ago b) this was a first important step to usable ai but not scalable. I'd say "RL training" is not the same as RLHF.