Hacker News new | ask | show | jobs
LatentSync1.6, an end-to-end lip-sync method (latentsync.com)
3 points by BruceWok 167 days ago
1 comments

LatentSync is a cutting-edge, open-source lip-synchronization framework powered by Audio-Conditioned Latent Diffusion Models. By integrating Whisper audio embeddings with advanced temporal alignment (TREPA), it transforms arbitrary audio and video inputs into photorealistic, high-resolution (512x512) talking head videos.