Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - This foundational textbook provides comprehensive coverage of data preprocessing techniques, including normalization and data representation, within the context of deep learning models.
Neural Networks and Deep Learning, Michael A. Nielsen, 2015 (Determination Press) - An online book explaining the mathematical and intuitive reasons behind data scaling and its impact on gradient descent convergence in neural networks.