| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by jangerhofer 3330 days ago

New-comer to the field here with a few questions.

Do I understand correctly that a checkpoint is just a snapshot of the model at a point in time? i.e. "Here are the probabilities of each outcome given the characteristics I have observed already."

Also, what does "fully converged" signify? Are there points in the course of training the model at which it is more appropriate to "save" progress than at other times?

1 comments

minimaxir 3330 days ago

> Also, what does "fully converged" signify?

In machine learning/deep learning, the decrease in training loss has major diminishing returns as training continues. Eventually, training the model hits a point where the loss barely improves each epoch/iteration. (fun visualization from one of my projects: http://minimaxir.com/img/char-embeddings/epoch-losses.png)

In some cases, the loss can stop improving entirely, or increase.

link

carbocation 3330 days ago

Any recs on choosing TensorFlow vs Keras-on-TensorFlow?

link

jacquesm 3329 days ago

Start with Keras, if you run into something you want to do that is not supported by Keras drop into TensorFlow, they are not mutually exclusive and all of TensorFlow is availble.

link

carbocation 3329 days ago

Fantastic. Thanks to you and minimaxir for the guidance.

link

minimaxir 3330 days ago

Keras does not add much overhead, if any, and yes, it is as easy as everyone claims.

link