Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - A widely recognized textbook for reinforcement learning, offering extensive details on TD(0), SARSA, and Q-Learning.
On-line Q-learning using Sarsa with experience replay, Gavin Adrian Rummery, Mahesan Niranjan, 1994 (Department of Engineering, University of Cambridge) - A technical report that first detailed the SARSA algorithm for on-policy temporal difference control.