Layer Normalization, Jimmy Lei Ba, Jamie Ryan Kiros, Geoffrey E. Hinton, 2016arXiv preprint arXiv:1607.06450DOI: 10.48550/arXiv.1607.06450 - Presents Layer Normalization as an alternative to Batch Normalization, particularly effective for recurrent neural networks and small batch sizes.
Group Normalization, Yuxin Wu, Kaiming He, 2018European Conference on Computer Vision (ECCV), Vol. 11211DOI: 10.48550/arXiv.1803.08494 - Introduces Group Normalization, which is robust to varying batch sizes and offers performance comparable to Batch Normalization in many cases.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - A foundational textbook providing comprehensive coverage of deep learning topics, including normalization techniques and best practices.
torch.nn.BatchNorm2d, PyTorch Documentation, 2024 - Official documentation explaining the usage, parameters, and behavior of BatchNorm2d in the PyTorch framework.