As a noob in Generative AI, could you go into more details about the face ipadapter and wav2lip models? How were you able to figure out that OP was using these models?
The generated people have the typical face adapter bugs. You can learn about them here [0]. wav2lip is a old gan based model, the output produces a slight green tint. The rest of the animation is just animatediff.
[0] https://www.youtube.com/watch?v=t2OBzV3UHv4
[1] https://github.com/Rudrabha/Wav2Lip