Hacker News new | ask | show | jobs
Diffusers: Modular Diffusion model library from HuggingFace (github.com)
47 points by kashifr 1436 days ago
2 comments

Can someone explain this to me? I've got a basic understanding of ML, but seem out of the loop lately.
Check out this video https://www.youtube.com/watch?v=HoKDTa5jHvg of the math. A lot of other sources skip over the math to give you the big picture. After that you will find the video very helpful in understanding the math.
Holy shit, this is one of the most well-produced, succinct, educational YouTube videos I've ever seen. Instantly subscribed.
The meta tl;dr is that diffusion is generally used for modern AI-image generation (with a lot of developments in that space), but the code workflows/Colab Notebooks are a mess and filled with technical debt, so a Hugging Face high-level approach for the tool makes sense.

HF recently hired the creators of one of the most popular Colab Notebooks: https://twitter.com/multimodalart/status/1549720764588376064

A library of pretrained diffusion models across multiple modalities, such as vision and audio.