| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by corysama 62 days ago

There technically was one experiment early on to trick Stable Diffusion into generating spectrograms that could be converted into audio. And, it worked surprisingly well.

https://web.archive.org/web/20230314190913/https://www.riffu...

https://huggingface.co/riffusion/riffusion-model-v1

But, I'd expect everything in the past 3 years to diffuse the audio waveform directly.

1 comments

Kye 62 days ago

That's probably what I was thinking of. I haven't kept up as much on non-text generative AI.

link