Y
Hacker News
new
|
ask
|
show
|
jobs
by
minimaxir
264 days ago
It's very hard for simultaneous good audio generation with video generation (simultaneous generation is necessary to maintain lip sync). Veo 3 et al also have flat monochannel audio, but not as bad as these Sora 2 demos.