Front-end factor analysis for speaker verification, Patrick Kenny, M. Stafylakis, and P. Ouellet, 2008INTERSPEECH 2008 (International Speech Communication Association (ISCA))DOI: 10.21437/Interspeech.2008-10 - Presents the i-vector framework, a powerful low-dimensional representation of speaker characteristics, originally for speaker verification but widely adopted for ASR.
X-vectors: Robust DNN Embeddings for Speaker Recognition, David Snyder, Daniel Garcia-Romero, Gregory Sell, Daniel Povey, and Alan McCree, 2018Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018) (IEEE)DOI: 10.1109/ICASSP.2018.8461375 - Introduces x-vectors, a prominent neural network-based speaker embedding technique used as auxiliary input for ASR adaptation.
Parameter-Efficient Transfer Learning for NLP, Neil Houlsby, Andrei Giurgiu, Stanislaw Swirszcz, Krzysztof Konecki, Alexis Coavoux, Dmitry Grishin, Jay Lemmon, and Marcin Michalski, 2021Journal of Machine Learning Research, Vol. 22DOI: 10.5555/3502209.3502213 - Presents adapter modules, a parameter-efficient method for adapting large pre-trained neural networks with minimal speaker-specific parameters.