|
|
|
|
|
by haolu7
1361 days ago
|
|
AITemplate-PyTorch Stable Diffusion is the fastest stable diffusion inference solution by pushing image generation below one second on A100 (batch 1: 0.7s / 25 steps, 1.3s / 50 steps; batch 3: 1.6s / 25 steps, per image 0.55s; batch 16 7.9s / 25 steps, per image 0.49s) for the first time, 2.57X faster than Keras' XLA-based GPU compilation solution. More benchmark numbers and repro at:
https://github.com/facebookincubator/AITemplate/tree/main/ex... |
|
One or two more optimizations and we're gonna have live-update results.