That sounds really cool, but coming from training other statistical models, im having a hard time imagining what the training loop looks like.