Hacker News new | ask | show | jobs
by acaciabengo 112 days ago
This is great and exciting. I happened to be doing some research to build memory-efficient diffusion models. I have not yet built the demo, but looking at a mix of architecture from several papers, IMTalker, SageAttension, FlashVSR, and Sparse VideoGen, with the intention to reduce memory to about 8GB.

The plan was to swap FlashAttention out, and also for an audio driver; SVG could have improved. At 60FPS, I think you are already doing this.

Great work.