Hacker News new | ask | show | jobs
by bitL 1180 days ago
Only for images. People want to generate videos next and those models will be likely GPT-sized.
1 comments

There is a video model making the rounds on /r/stablediffusion and it is just a tiny bit larger than Stable Diffusion.
You're not kidding! it's far from perfect, but pretty funny still...

https://www.reddit.com/r/StableDiffusion/comments/126xsxu/ni...

Too bad SD learned the Shutterstock watermark so well, lol

It's cool though not very stable in details over temporal axis.
Of course the quality is horrible relative to a proper video, it just illustrates that txt2vid might not need 100B+ parameters.