| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by hannesfur 487 days ago
	you mean fine tuning that feels like SFT but is different (since you can't use that with reasoning models) built around the DeepSeek class of models?

1 comments

I just want to fine tune deepseek v3 chat but it’s not possible or easy for regular consumers