|
|
|
|
|
by pclmulqdq
1375 days ago
|
|
I recently went to stable diffusion for some art for a D&D campaign guide, to make the thing more immersive. While the pictures are impressive, there are a lot of things about the generated art that just don't make sense: In one picture, a tower had a staircase down 1/3rd of the way from the door to the ground, just stopping at that point. Most had issues like this. Several other pictures I wanted were impossible to generate. The field of "art that needs human communication skills" seems to be a lot broader than just scientific illustration. |
|
SD obviously doesn't understand language in the same way we do, so it can be tricky to describe things in a way that will match your expectations. Once you start to understand the tricks here, it gets easier and easier.
Inpainting will let you fix a lot of the rest. Staircase stops? Select the area where it stopped, get the AI to generate more. People are already doing this to create very complex artwork where there are issues with faces, hands, etc. https://www.reddit.com/r/StableDiffusion/comments/x9u8qh/img... is a great example of how you can quickly iterate over a scene.
One of the other things people struggle with is consistent characters and settings, but people have found ways to improve this with Midjourney - https://docs.google.com/document/u/1/d/e/2PACX-1vRahIr3-h_V3...
There's more of a learning curve to these tools than most people think, but it's also still miles and miles away from the learning curve required to actually be proficient at the technical aspects of making art.