Inspecting Gradients for Issues (Vanishing/Exploding)
Was this section helpful?
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive textbook that provides a thorough theoretical background on deep learning, including detailed explanations of backpropagation, vanishing/exploding gradients, and various mitigation strategies.
Autograd mechanics, PyTorch Contributors, 2023 (PyTorch) - Official PyTorch documentation detailing how gradients are computed, stored, and accessed using the Autograd system, which is essential for detecting and debugging gradient issues.
Deep Residual Learning for Image Recognition, Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun, 2016Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (IEEE)DOI: 10.1109/CVPR.2016.90 - Presents Residual Networks (ResNets) with skip connections, a foundational architecture that effectively addresses the vanishing gradient problem in very deep networks by providing alternative gradient paths.