| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Auracle 359 days ago
	You can’t split image models over 2 GPUs like you can LLMs.

1 comments

BoredPositron 359 days ago

They also released an inference server for their models. Wan and qwen-image can be split without problems. https://github.com/modelscope/DiffSynth-Engine

link

Auracle 357 days ago

Unless I missed something just from skimming their tutorial it looks like they can do parallelism to speed things up with some models, not actually split the model (apart from the usual chunk offloading techniques).

link