Understanding the difficulty of training deep feedforward neural networks, Xavier Glorot, Yoshua Bengio, 2010Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Vol. 9 (PMLR) - The original academic paper proposing Xavier initialization (also known as Glorot initialization) for neural networks.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - A comprehensive textbook with a dedicated section explaining various parameter initialization strategies and their theoretical foundations.
torch.nn.init - PyTorch documentation, PyTorch Team, 2022 (PyTorch Foundation) - Official PyTorch documentation detailing built-in functions for initializing neural network weights, including Xavier and Kaiming (He) methods.