I think you're underselling probabalistic best-fits. Especially with all of the regularization going on in training.