Y
Hacker News
new
|
ask
|
show
|
jobs
by
bradfa
212 days ago
Got any links to explanations of why fine tuning open models isn’t a productive solution? Besides renting the GPU time, what other downsides exist on today’s SOTA open models for doing this?
1 comments
RC_ITR
210 days ago
When the new pre-trained parameters come out in a new model generation, your old fine tuning doesn't apply to them.
link