Hacker News new | ask | show | jobs
Bagel • Unified Model for Multimodal Understanding and Generation (github.com)
7 points by montyanderson 400 days ago
2 comments

Curious there is no discussion on this. I think it looks interesting.

If nothing else, I'm glad someone is still working on open-weight image models. AFAIK there hasn't been much movement in the area since Flux.

Was looking at the model and was curious about HN comments, thought this would be a good talking piece since it has been released open, haven’t tried to run it locally yet but will do soon as I can.
There has been some discussion in /r/stablediffusion I'm not sure if anyone tried to run it though.
It's the luck of the submission time window; currently: https://news.ycombinator.com/item?id=44094362
The model itself appears to be around 30gb, my rule of thumb double it for ram. So should run on 60gb vram/unified ram ?