Y
Hacker News
new
|
ask
|
show
|
jobs
by
abc-1
442 days ago
Neat. Maybe you guys should make a fine tuning platform for deepseek specifically, with a fine tune API similar to openAIs. You could expand out into hosting those models too.
1 comments
hannesfur
442 days ago
you mean fine tuning that feels like SFT but is different (since you can't use that with reasoning models) built around the DeepSeek class of models?
link
abc-1
442 days ago
I just want to fine tune deepseek v3 chat but it’s not possible or easy for regular consumers
link
hannesfur
441 days ago
Fireworks has you covered now:
https://fireworks.ai/blog/fine-tuning-deepseek-models
:)
link