Auto-Encoding Variational Bayes, Diederik P Kingma, Max Welling, 2013International Conference on Learning Representations (ICLR)DOI: 10.48550/arXiv.1312.6114 - The foundational paper introducing Variational Autoencoders (VAEs) and the reparameterization trick, making VAEs trainable with gradient-based methods.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Chapter 20 provides a comprehensive theoretical background on Variational Autoencoders, with a clear explanation of the reparameterization trick.