I am one of the authors on this paper and am happy to answer any questions you may have.
[0] https://arxiv.org/abs/1702.07825
[1] http://research.baidu.com/deep-voice-production-quality-text...