For those of us interested in this area of research what are the best papers and other resources for us to read? Has there been any success with deep approaches that do not have the WaveNet architecture?
Check out Char2Wav (recent) and SampleRNN (the RNN-based audio synthesis architecture). The related work section of the Deep Voice paper mention a bunch of related papers that are relevant!