|
|
|
|
|
by JimDabell
245 days ago
|
|
Yeah, I brought that up here and got some interesting responses: > It would be interesting to see how far you could get using deepfakes as a method for video call compression. > Train a model locally ahead of time and upload it to a server, then whenever you have a call scheduled the model is downloaded in advance by the other participants. > Now, instead of having to send video data, you only have to send a representation of the facial movements so that the recipients can render it on their end. When the tech is a little further along, it should be possible to get good quality video using only a fraction of the bandwidth. — https://news.ycombinator.com/item?id=22907718 Specifically for voice, this was mentioned: > A Real-Time Wideband Neural Vocoder at 1.6 Kb/S Using LPCNet — https://news.ycombinator.com/item?id=19520194 |
|
You could probably also transmit a low res grayscale version of the video to “map” any local reproduction to. Kinda like how a low resolution image could be reasonably reproduced if an artist knew who the subject was.