Layer Normalization, Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E. Hinton, 2016Advances in Neural Information Processing Systems (NeurIPS 2016)DOI: 10.48550/arXiv.1607.06450 - The original research paper introducing Layer Normalization and detailing its architecture and benefits, particularly for recurrent neural networks.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive textbook that includes detailed discussions on various normalization techniques, including Batch and Layer Normalization, within the broader scope of deep learning optimization.