Hacker News new | ask | show | jobs
by julespitt 3809 days ago
Went looking for audio samples, here's some from one of the researchers:

http://www.zhizheng.org/demo/is15_mte/demo.html http://www.zhizheng.org/demo/dnn_tts/demo.html

1 comments

I thought this would be about text-to-speech applications, while this seems more like an encoder-decoder problem (make the network learn a pattern and then let it reproduce it). I'm wondering how long it is until we see working TTS based on LSTM RNNs.
Yeah, can someone explain the exact problem of "statistical parametric speech synthesis," since I can't find a general overview of the problem itself.
I'm a newbie to all this, but I can imagine it could be useful for speech compression.