Unsloth doesn't have an official multi-GPU story: there's hacked together solutions but they're finicky as it is for smaller models
In general Deepseek has very few resources on finetuning, that get even further muddied by people referring to the distills when they claim to be finetuning it.
I've been trying to actually finetune Deepseek (not distills) and there are few options