|
|
|
|
|
by erwannmillon
949 days ago
|
|
LCMs are actually exactly SD architecture. LCM is initialized from a regular SD unet and finetuned on a new objective. We are already compiling to get to these times. A lot of other people getting sub-100ms times are using fewer inference steps than we do, at a quality tradeoff. |
|
Hmm, well if you mean torch.compile, y'all should still check out stable-fast, which is claiming ~16ms/iter on a 4090, twice that of torch.compile:
https://github.com/chengzeyi/stable-fast#rtx-4090-512x512-ba...