I found the Obama video at the end very interesting. It would be a neat next step to map non-Obama audio to the generated video. For example, pull audio from an Obama impersonator.
We can impersonate voices with neural nets. We can clone timbre and style, and this tech is being used commercially by Baidu at the very least (keyword: Deep Voice 3).
We can impersonate voices with neural nets. We can clone timbre and style, and this tech is being used commercially by Baidu at the very least (keyword: Deep Voice 3).