| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by embedding-shape 138 days ago
	> We need models that keep on learning (updating their parameters) forever, online, all the time. Do we need that? Today's models are already capable in lots of areas. Sure, they don't match up to what the uberhypers are talking up, but technology seldom does. Doesn't mean what's there already cannot be used in a better way, if they could stop jamming it into everything everywhere.

1 comments

pankajdoharey 138 days ago

Continuous learningin current models will lead to catastrophic forgetting.

link

DoctorOetker 137 days ago

will catastrophic forgetting still occur if a fraction of the update sentences are the original training corpus?

is the real issue actually catastrophic forgetting or overfitting?

nothing prevents users from continuing the learning as they use a model

link

thesz 137 days ago

Catastrophic forgetting is overfitting.

link

pankajdoharey 131 days ago

No, it’s actually the math of overwriting. Imagine you hiked down into a valley Task A and settled there. Then, you decide to climb a new mountain to find a different valley Task B. You successfully move to the new valley, but in doing so, you destroy the path back to the first one. You are now stuck in the new valley and have completely 'forgotten' how to get back to the first one.

link

davidguetta 136 days ago

not exactly, not at all even in term of the way the llm are trained.

In RL it can be that you are not getting meaningful data anymore because you are 'too good' and dont get anymore the "this is a bad answer" signal so you can't estimate the gradient.

link