| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by swid 1206 days ago
	It's used for fine tuning a pre-trained model. This takes an LLM that is already capable of emulating lots of different kinds of personalities, and narrows it down to act more like the examples. Since the heavy lifting has already been done, 15k examples of a chatbot following instructions they way you want has a significant effect.