| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by littlestymaar 826 days ago
	By default it's just going to be a text completion model, you want an additional round of training to make it behave like a chatbot. I guess you could probably get away with just fine-tuning on chatbot discussions, but everybody uses RLHF so I guess it must be much more efficient for that.