| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bitL 1180 days ago
	Only for images. People want to generate videos next and those models will be likely GPT-sized.

1 comments

There is a video model making the rounds on /r/stablediffusion and it is just a tiny bit larger than Stable Diffusion.

You're not kidding! it's far from perfect, but pretty funny still...

Too bad SD learned the Shutterstock watermark so well, lol

It's cool though not very stable in details over temporal axis.

Of course the quality is horrible relative to a proper video, it just illustrates that txt2vid might not need 100B+ parameters.