|
|
|
|
|
by jangerhofer
3330 days ago
|
|
New-comer to the field here with a few questions. Do I understand correctly that a checkpoint is just a snapshot of the model at a point in time? i.e. "Here are the probabilities of each outcome given the characteristics I have observed already." Also, what does "fully converged" signify? Are there points in the course of training the model at which it is more appropriate to "save" progress than at other times? |
|
In machine learning/deep learning, the decrease in training loss has major diminishing returns as training continues. Eventually, training the model hits a point where the loss barely improves each epoch/iteration. (fun visualization from one of my projects: http://minimaxir.com/img/char-embeddings/epoch-losses.png)
In some cases, the loss can stop improving entirely, or increase.