Hacker News new | ask | show | jobs
by niwrad 1165 days ago
I feel that Midjourney v5 really lets you explore different worlds.

One recent feature the guide missed is the permutation and repeat features [1]. They're quite helpful for power users that want to explore multiple styles quickly.

Last week I tried putting together a short film using GPT-4 and Midjourney v5. I was stunned by the cinematic frames Midjourney v5 was able to create:

https://youtu.be/6O_tOuUcG9s

I (human) wrote the prompts for Midjourney, though.

[1] https://docs.midjourney.com/docs/permutations

5 comments

Damn. It's no Harry Potter by Balenciaga, but it's surprisingly compelling given that most of it was generated (granted, with prompting) by AI tools. I notice you credited GPT-4, Midjourney, and Metavoice, but was the music AI generated as well?

I've gotta say, I've seen worse storytelling and cinematography come from actual, serious humans who were getting paid to do it. And the Balenciaga thing is obviously a joke, because that whole genre only works because the style of those videos is beyond parody and sails right over the uncanny valley. This is different. This is interesting. I like it.

Thanks for your kind comments!

I'm glad you pointed out the music. The music was also AI generated with a tool called AIVA [1]. I'd never composed a piece of music before, and I was pretty surprised by what I could "create". I spent 30~60 minutes max creating the score.

Some parts of their product still feel janky, but as an overall concept, it's quite fascinating. One of the interactions I enjoyed was that AIVA creates scores with different tracks (layers). So I was able to edit tracks I don't like (e.g., change a Piano track to Brass) or have AIVA completely regenerate certain sections of the score (e.g., redo the bridge, regenerate the chorus sections).

One difference from Midjourney is that there's no text-based prompting. Instead, you "prompt" through music inspiration.

[1] https://www.aiva.ai/

Ah, cool!

So, basically, if I want it to compose Baroque music, I can give it Vivaldi, Bach, and maybe a little early classical, tell it to go to work, and end up with something that sounds like it came out in 1765?

I wonder what the limitations are on that whole "musical prompting" deal.

Your video was better in my opinion because it has a real story. All the Balenciaga videos out there are really just realistically rendered parodies with little or no emotion to them.

Belenciaga piece is hilarious.
It is, really. There are a bunch of imitators out there now, but they're less funny to me. I don't think it's because they're generally less well done. While most of them that I've seen are less well done, in the sense that the voices are more "computer-y" sounding, and the rendering not as good, I think it's because the original really is a parody, and doing a parody of a parody just starts getting more ridiculous without getting funnier.
Really nice video! The person's hand with 2 thumbs at 0:47 can really tell it was created using Midjourney. It usually does fingers very wrong! lol
nightmare fuel indeed
Impressive. Thanks for sharing
Curious, did all your work necessitate subscribing to the $30 plan?
I ended up subscribing to the $60 plan, mainly to get access to the Stealth Mode. I used about ~4h of fast time during the project. With that said, I could have created this with the $10 plan (3.3 hours) if I had to.
Why do you need stealth mode?
wow I had no idea about permutations in Midjourney and it's an amazing feature! thank you very much!!