Y
Hacker News
new
|
ask
|
show
|
jobs
by
hannesfur
442 days ago
you mean fine tuning that feels like SFT but is different (since you can't use that with reasoning models) built around the DeepSeek class of models?
1 comments
abc-1
442 days ago
I just want to fine tune deepseek v3 chat but it’s not possible or easy for regular consumers
link
hannesfur
441 days ago
Fireworks has you covered now:
https://fireworks.ai/blog/fine-tuning-deepseek-models
:)
link