| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by stefanwebb 270 days ago
	There’s a similar library that also includes data synth and LLM-as-a-Judge: https://github.com/oumi-ai/oumi

1 comments

BoorishBears 270 days ago

Yet another framework lying about Deepseek support.

I've been trying to actually finetune Deepseek (not distills) and there are few options

link

3abiton 270 days ago

Which version were you trying? Doesn't unsloth already support finetuning?

link

BoorishBears 270 days ago

Previous V3 base

Unsloth doesn't have an official multi-GPU story: there's hacked together solutions but they're finicky as it is for smaller models

In general Deepseek has very few resources on finetuning, that get even further muddied by people referring to the distills when they claim to be finetuning it.

link