Efficient Streaming Transformer-based ASR for Production, Yu-An He, Tian-An Li, Liang-Yan Gui, Lei Xie, Zheng-Wen Zhang, 2020Proceedings of Interspeech 2020 (ISCA (International Speech Communication Association))DOI: 10.21437/Interspeech.2020-1681 - Discusses design choices and optimization strategies for deploying Transformer-based ASR systems in real-time streaming environments.
Live Speech Transcription with Sequence-to-Sequence Models, Nitesh Jaitly, Yonghui Wu, Ruoming Pang, Yong Li, Chung-Cheng Chiu, Anjuli Kannan, Xiang Li, Rachel Batson, David Rybach, Patrick Nguyen, Navdeep Jaitly, 2019Proceedings of Interspeech 2019 (ISCA)DOI: 10.21437/Interspeech.2019-1506 - Discusses the architectural choices and challenges for building low-latency, high-accuracy live speech transcription systems.