Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A foundational textbook covering the theory and practice of deep learning, including comprehensive discussions on overfitting, generalization, and various regularization techniques.
The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Trevor Hastie, Robert Tibshirani, and Jerome Friedman, 2009 (Springer) - A classic reference for statistical learning, providing a detailed foundation on concepts like Ridge and Lasso regression, which are the basis for L2 and L1 weight regularization.