Hacker News new | ask | show | jobs
by hannesfur 442 days ago
you mean fine tuning that feels like SFT but is different (since you can't use that with reasoning models) built around the DeepSeek class of models?
1 comments

I just want to fine tune deepseek v3 chat but it’s not possible or easy for regular consumers