Hacker News new | ask | show | jobs
by wwwtyro 601 days ago
With consistent representation of characters, are we now on the precipice of a Cambrian explosion of manga/graphic novels/comics?
4 comments

I sure hope so - at the very least I will use it for tabletop illustrations instead of having to describe a party's scenario result - I can give them a character-accurate image showing their success (or epic lack thereof).
It’s not really consistent - or anymore consistent than, say, SDXL with IP adapter. Even in their example images the character they’ve input comes out wearing different clothes.
I would say we already had one of those. There's more hand crafted human made content available than anyone cares to read.

While this will enable a certain degree of more spam it will more importantly, on the positive side of things, democratize the creative process to those who want to tell a story in images but lack the skill and resources to churn it out traditionally.

not yet, still can't generate transparent images
From the controlnet author:

Transparent Image Layer Diffusion using Latent Transparency

https://arxiv.org/abs/2402.17113

https://github.com/lllyasviel/sd-forge-layerdiffuse

Why do you need that? For manga specifically, generate in greyscale and convert luminance to alpha; then composite; then color.

Or, if you need solid regions that overlap and mask out other regions, then generate objects over a chroma-keyable flat background.