| HN Mirror

There’s a similar right of passage for Deep Learning engineers using python and Keras - every single one of us has been burned by waiting for a week for a model to train, only to find “h5py” is not installed, exception is thrown and all work lost. (That’s the python module used for persistence of the model weights)

Then the young grasshoppers learn about checkpointing and using our dev ops systems to make sure the environment is up to spec, but I feel this has to happen at least once to each Deep Learning researcher