And serving is not training. For distilling you need to train the big models to have something to be distilled.