Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - An authoritative textbook with a detailed discussion of Batch Normalization, including the distinction between training and inference time operations.
torch.nn.BatchNorm2d, PyTorch Developers, 2024 - Official PyTorch documentation for the 2D Batch Normalization layer, detailing the use of running_mean, running_var, and track_running_stats for training and evaluation modes.