|
|
|
|
|
by bayindirh
2066 days ago
|
|
Since the losses in the analog conversion process cannot be determined exactly, the model is bound to add some noise to the converted audio. Video has more spatial data to guess the color and motion so it's easier in practice. The unconverted sound may be crisper and has more details but, there's no guarantee that they're the original details so, it won't be the original recording itself. |
|
Perhaps you have to play the media a few dozen times and do the media equivalent of frame stacking to see through the noise.
It's also quite possible no ML would be needed. I don't think frame stacking uses ML.
I wouldn't be surprised if you could play a song on repeat from the other side of your house and extract a very good copy of it, so long as you knew exactly when the song began and looped. You might only need to know the length of the song, not even when it began.
It might not be practical, but it would be a cool blog post.