Y
Hacker News
new
|
ask
|
show
|
jobs
by
thaw13579
1088 days ago
It’s more than next-word prediction though. The supervised fine tuning and RLHF steps are ways to possibly train it to favor truthful answers. Not sure whether this is currently the emphasis of ChatGPT though…