|
|
|
|
|
by bhasinanant
111 days ago
|
|
Gemini 3.1 Pro isn't necessarily the next frontier Coding model, but it is still an amazing model.
Everyone has taken note of it's crazy good svg skills, but I had an idea to test this further. I needed to create a launch video for a project, and had this idea of an ongoing story going on in the background of the content, with a one-shot video of a person going about the daily office grind.
When Gemini 3.1 Pro came out, I decided to try to make an animated version of it first. So, the stack for this process looks like:
1. AI Studio Build. Very quick, great scaffolding. Already aware of React Motion and other related helpful libraries.
2. Gemini 2.5 TTS
3. OpenShot to compile the video and the audio together. Manual work needed:
1. Getting the script together. Actually also done using Gemini in the Gemini Web App. Recorded a walkthrough of my application, and just asked it to go from there.
2. Adjusting the time allotted to each of the scenes. AIStudio created the scenes pretty well, but they followed the time durations in the ideal script. This needed to be modified to be synced with the generated Audio. Needed just one prompt though.
3. Recording the Animation. Go Full Screen. OBS Studio Recording the window. Animation runs on autoplay, and hides the player controls. Move mouse to another screen. Record and Reload page to start from 0:00.
4. Compiling the shot and audio together. New Project in OpenShot. Bring resources in. Clip Video to start and end at the correct spots. Audio is already synchronized. Export. Total money spent: 0$ Additional Insight: I initially forgot to switch the model, and got a Gemini 3.0 generation instead. Needless to say, that chat was a lot more cumbersome and the result a lot less appealing. I am so happy with the result that it just seems obvious to build a wrapper that does all of the steps from recording to editing for you, but would love to hear what everybody else thinks. |
|