| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by cubefox 1189 days ago
	The model was just instruction tuned. That is, it can answer questions and respond to instructions. Unless especially prompted, a pure predictor would often not respond to instructions but elaborate on them. To get specific types of responses, you would need something like RLHF.