Numerical Optimization, Jorge Nocedal and Stephen J. Wright, 2006 (Springer)DOI: 10.1007/978-0-387-40065-5 - A standard text for optimization algorithms, providing detailed explanations of methods like Newton's method that utilize second-order derivatives and Hessian-vector products.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Provides a comprehensive treatment of automatic differentiation and its application in deep learning, including discussions on second-order optimization techniques.