| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sashank_1509 1143 days ago
	They used to offer exactly this for fine tuning models. Never offered it after ChatGPT, I think the difficulty comes with fine tuning RLHF models, not obvious how to correctly do this.