| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sebastiennight 740 days ago
	This is the result of RLHF (which is fine-tuning to make the output more palatable), but this is not what training is about. The actual training process makes the model output be the likeliest output, and the introduction phrase you quoted would not come out of this process if there was no RLHF. See GPT3 (text-davinci-003 via API) which didn't have RLHF and would not say this, vs. ChatGPT which is fine-tuned for human preferences and thus will output such giveaways.