|
|
|
|
|
by jasonhanley
510 days ago
|
|
I agree the multimodal stuff is amazing. I'm seriously impressed with the new Gemini 2.0 family of models and can't wait until the full multimodal capabilities are in general release. In terms of the HeyGen vid, it's passable, but that was something I literally whipped up in 10 minutes. You can make ones that are much, much better if you invest in creating better training material. The voice and video model in this case only used the one 3-minute source video. Funny you mention the "people zoo" thing. That's actually part of a sci-fi story I've been trying to write since I was in my teens. Roughed out here: https://youtu.be/2KLdaVs_ugw |
|