| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ilovefood 625 days ago
	Looks pretty cool, congrats so far! Do you allow downloading the fine tuned model for local inference?

1 comments

felix089 625 days ago

Thank you, and yes that is possible. Which model are you looking to fine-tune?

link

ilovefood 625 days ago

If that's the case then I'll try the platform out :) I want to finetune Codestral or Qwen2.5-coder on a custom codebase. Thank you for the response! Are there some docs or infos about the compatibility of the downloaded models, meaning will they work right away with llama.cpp?

link

farouqaldori 625 days ago

We don't support Codestral or Qwen2.5-coder right out of the box for now, but depending on your use-case we certainly could add it.

We utilize LoRA for smaller models, and qLoRA (quantized) for 70b+ models to improve training speeds, so when downloading model weights, what you get is the weights & adapter_config.json. Should work with llama.cpp!

link