Hacker News new | ask | show | jobs
by joshcryer 1453 days ago
I've made perhaps overly absolutist statements like "don't you see! this kills artists jobs!" and it was shrugged off as if I was insane. I probably could've phrased it differently, but to me this is game changing in several fields. Granted, it will open up a new field of "generative artists" but, having played with these things, this is a pretty trivial job, and their training nets are only going to get better.
3 comments

I’ve had a lot of fun playing with Disco Diffusion prompts, but I agree that the people excited about “a generation of prompt artists” are a bit misguided. Soon an AI will emerge that can come up with “better” prompts than you, and the “art” of creating prompts will have a lower skill ceiling.
Like a neutral network just for making prompts that result in aesthetically pleasing Imagen images? And then maybe we can come up with a neutral net that can decide which pictures are good and which aren't. Then we can just have robots making art for the sake of consumption solely by robots.
The GPT algorithms are actually pretty good at making detailed image generation prompts if you ask it to describe in detail the general idea you want.
Do you have a link to any papers about this? Would love to check them out
No, just playing around with dall-e mini (no access yet to anything else) and beta.openai.com's text-davinci-002 model. For instance, if I ask dall-e mini for "painting with dancers":

https://i.imgur.com/flXoTgZ.png

I can ask davinci-002 "Vivid description of a painting with dancers:" and get:

The painting is of two dancers in a passionate embrace, their bodies entwined as they move together in a sensual dance. The woman's dress is flowing and reveals her curves, while the man's shirt is open, revealing his muscular chest. They are surrounded by a crowd of people who are watching them with looks of admiration and desire. The painting is full of color and movement, and the dancers seem to be in a world of their own, lost in their passion for each other.

And then pass that to dall-e mini:

https://i.imgur.com/eOIQuPF.png

dall-e mini is sadly not quite up to the challenge, but it gives the generation a lot more detail. Some other examples:

"The painting is of two dancers in the middle of a dance. They are both wearing white, and their hair is flowing around them as they move. The background is a blur of color, and the light is shining on the dancers, making them look like they are in the spotlight."

https://i.imgur.com/ldktMHO.png

"The painting is full of energy and movement, with the dancers leaping and spinning around the stage. They are all wearing brightly coloured costumes, which stand out against the dark background. The light from the stage spotlight is shining on them, making them look even more vibrant. The whole scene is full of life and excitement."

https://i.imgur.com/1KFJbzJ.png

To me, it paves the way for creative prototyping. I don't see this as a zero-sum game between artists and AI. Instead, I could see artists using this for some serious time saving, and leveraging that extra time and energy for creating better results.
It could also be used for more nefarious reasons like disinformation campaigns though... it will be interesting to see what the next few years have in store
You don't need good-looking pictures for propaganda. Old people (the main targets) believe literally anything they see on Facebook, especially if it confirms their priors aka fits their worldview, and prefer it to look bad because that's more authentic. For anyone else, the point is to make them disbelieve everything, not to believe you specifically.