Mathematics for Machine Learning, Marc Peter Deisenroth, A. Aldo Faisal, Cheng Soon Ong, 2020 (Cambridge University Press) - Covers foundational linear algebra concepts applied to machine learning, including linear regression and the derivation of normal equations.
An Introduction to Statistical Learning with Applications in R, Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani, 2021 (Springer) - Introduces linear regression, its derivation, and regularized variants like Ridge regression, providing a statistical learning perspective.
Deep Learning, Ian Goodfellow, Yoshua Bengio, Aaron Courville, 2016 (MIT Press) - Explains fundamental linear algebra concepts as they apply to machine learning models and optimization.
CS229 Lecture Notes: Supervised Learning, Andrew Ng, Tengyu Ma, 2023 (Stanford University) - Provides a clear derivation and explanation of linear regression and the normal equations within a machine learning context.