Hacker News new | ask | show | jobs
by TJSomething 1248 days ago
One of the important parts of ChatGPT over plain GPT-3 is the reinforcement learning from human feedback to ensure alignment, without which it's not quite as good of a product for the public.