Hacker News new | ask | show | jobs
by stavros 1118 days ago
Reinforcement learning through human feedback.

Took me a bit of searching too.