| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by thaw13579 1088 days ago
	It’s more than next-word prediction though. The supervised fine tuning and RLHF steps are ways to possibly train it to favor truthful answers. Not sure whether this is currently the emphasis of ChatGPT though…