Hacker News new | ask | show | jobs
by minimaxir 3330 days ago
> Also, what does "fully converged" signify?

In machine learning/deep learning, the decrease in training loss has major diminishing returns as training continues. Eventually, training the model hits a point where the loss barely improves each epoch/iteration. (fun visualization from one of my projects: http://minimaxir.com/img/char-embeddings/epoch-losses.png)

In some cases, the loss can stop improving entirely, or increase.

1 comments

Any recs on choosing TensorFlow vs Keras-on-TensorFlow?
Start with Keras, if you run into something you want to do that is not supported by Keras drop into TensorFlow, they are not mutually exclusive and all of TensorFlow is availble.
Fantastic. Thanks to you and minimaxir for the guidance.
Keras does not add much overhead, if any, and yes, it is as easy as everyone claims.