|
|
|
|
|
by danielbln
389 days ago
|
|
You are underselling it because you make it sound like all the model adds is some foley, when in fact it adds facial animations that are in line with the dialogue spoken. Go ahead and create a Kling render that I only need to add VO to, you can't because Kling doesn't do that. You need a Omnihuman level model (or veo3) for that and it makes all the difference. Happy to agree to disagree, but imo this absolutely is a step change. |
|
I don't need to create anything for you. Go visit r/aivideo and go look at the Kling or even the Hailuo Minimax (admittedly worse in fidelity) attempts. Some of them have been made to even sing or do podcasts. Again. They've been there for at least 6-10 months ago, this happens to generate it as one output. It's not nothing, but this really exposes a lot of the people who aren't familiar with this space when they keep overestimating things they've probably seen a months ago. Somewhat accurate expressions? Passable lipsyncing? All there. Even with the weaker models like Runway and Hailuo.
Again. Use the products. You'll know. Hobbyists have been on it for quite sometime already. Also. I didn't say they were just adding foley, though I can argue the quality of the sound they're adding, that's not my point. My point is, is that everytime something like this comes out there's always people ready to speak on "what industries such thing can destroy right now" before using the thing. It's borderline deranged.