Y
Hacker News
new
|
ask
|
show
|
jobs
by
BoredPositron
319 days ago
They also released an inference server for their models. Wan and qwen-image can be split without problems.
https://github.com/modelscope/DiffSynth-Engine
1 comments
Auracle
317 days ago
Unless I missed something just from skimming their tutorial it looks like they can do parallelism to speed things up with some models, not actually split the model (apart from the usual chunk offloading techniques).
link