| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by stefanwebb 241 days ago
	Seems topical given some recent front-page HN articles on fine-tuning. I discuss a large-scale empirical study from 2014 of fine-tuning 7B models to outperform GPT-4 and GPT-3.5-Turbo, as well as arguments why fine-tuning is coming back into favor