Hacker News new | ask | show | jobs
by stefanwebb 270 days ago
There’s a similar library that also includes data synth and LLM-as-a-Judge: https://github.com/oumi-ai/oumi
1 comments

Yet another framework lying about Deepseek support.

I've been trying to actually finetune Deepseek (not distills) and there are few options

Which version were you trying? Doesn't unsloth already support finetuning?
Previous V3 base

Unsloth doesn't have an official multi-GPU story: there's hacked together solutions but they're finicky as it is for smaller models

In general Deepseek has very few resources on finetuning, that get even further muddied by people referring to the distills when they claim to be finetuning it.