Hacker News new | ask | show | jobs
by kantthpel 821 days ago
Thank you so much! The biggest issue with Encodec (especially the 48kHz version) is that it is very dependent on normalization. This wasn't an issue for their use case (music) since music generally doesn't contain silent portions, but not so for samples. Many oneshots and loops have a great deal of silence or very quiet portions of the waveform, which when normalized become essentially pure noise. Training our custom autoencoder to handle this issue was one of the key factors which enabled us to get such good audio quality.