| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by taneq 1002 days ago
	That’s what the fine tuning is about. It learns the language, concepts etc. from the main dataset and is then tweaked by continuing to train on a smaller, high quality, hand curated dataset. That’s how it learns to generate conversational responses by default instead of needing a complicated prompt.