Hacker News new | ask | show | jobs
by globalvisualmem 1386 days ago
There is also recent work by Google called DreamBooth, though similar to Imagen/Parti Google refuses to release any model or code.

https://dreambooth.github.io/

2 comments

Sounds like there is a chance might open source a version for Stable Diffusion. Let's see though.

From Twitter : >Awesome job! That really extends the applicability of powerful generative models nowadays. Could I ask if you have any timetable for releasing the code please? >We are working on plans for implementation on other open source models

https://twitter.com/jason_dingzc/status/1563578510958297089

Yeah, they allude to supporting more than one token for the identifier, which would be nice
Textual inversion also supports more than one embedding for identifier, just change num_vectors_per_token in the yaml config. Example: https://www.reddit.com/r/StableDiffusion/comments/wzf1qk/sd_...
so is dreambooth worth open-sourcing then given textual inversion?
From the textual inversion guy's own comment on Twitter

>The objective is similar, but it's: (1) A different approach - they also fine tune the model itself, and they get much much better identity preservation!

dreambooth retains higher fidelity as the model is finetuned, but to be honest I think textual inversion is actually more applicable as you can just add some embeddings to inject new knowledge into the model and not an whole new model just for a single concept (if you want to share it with others). Also I have not seen dreambooth being applied to replicate styles.