Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - Comprehensive textbook covering the theoretical foundations and algorithms of reinforcement learning, including a detailed explanation and derivation of the Policy Gradient Theorem. Second edition.