Hacker News new | ask | show | jobs
by r--man 1350 days ago
I disagree. It's a rudimentary features of all these models to take a concept picture and refine it. It won't be like the director would give a prompt and get a feature length movie, it will be more like the director uses MS Paint (as in a common software for non tech people) to make a scene outline and directs AI to make a stylish and animated version of that. Something is wrong? just erase it and try again. Dalle2 had this interface from the get go. The models just haven't gotten there yet.
1 comments

Try again and do what? How are you directing the shot? How do you erase an emotion? How do you erase and redo inner turmoil when delivering a performance?
You tell it, "do it all over again, now with less inner turmoil". Not joking, that's all it's going to take. There are also a few diffusion based speech generators that handle all sounds, inflections and styles, they are going to come in handy for tweaking turmoil levels.
Yep!

"Restyle that last scene, showing different mixtures of fear/concern/excitement on male lead's face. Try to evoke a little of Harrison Ford's expressions in his famous roles. Render me 20 alternate treatments."

[5 minutes later]

«Here are the 20 alternate takes you requested for ranking.»

"OK, combine take #7 up to the glance back, with #13 thereafter."

«Done.»