| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bradfa 212 days ago
	Got any links to explanations of why fine tuning open models isn’t a productive solution? Besides renting the GPU time, what other downsides exist on today’s SOTA open models for doing this?

1 comments

When the new pre-trained parameters come out in a new model generation, your old fine tuning doesn't apply to them.