FastSpeech 2: Fast and High-Quality End-to-End Text to Speech, Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu, 2021International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.2006.04558 - Improves upon FastSpeech by eliminating the teacher model dependency and introducing a variance adaptor for better prosody control (pitch, energy), further enhancing quality and robustness.