What's interesting to me is that the project feels very "un-Apple", despite being open-sourced under the Apple org; some typos and lack of proper punctuation in the README, using jupyter notebooks for the data processing instead of scripts or a CLI, poor repo organization, no comments even in the demo: https://github.com/apple/ml-mgie/blob/main/demo.ipynb
Apple truly becoming an ML company when they release ML Engineer quality code ;)
I came up with a similar idea to this (also pre-Dalle edits-via-instruction) with the idea that prompting generators kinda sucks (also chat interfaces for image editing aren't great) and really you just want to explore the latent space "around" an initial prompt.
I wish they had more examples. the image doesn't seem to be that much better than if you generate an image with stable diffusion and then tweak the prompt.
> Notices: Apple's rights in the attached weight differentials are hereby licensed under the CC-BY-NC license. Apple makes no representations with regards to LLaMa or any other third party software, which are subject to their own terms.
Wait, they can do that? Assuming weights have copyright, shouldn't the finetuning be a modification of the original work and so have the same license?
Apple truly becoming an ML company when they release ML Engineer quality code ;)