Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto, 2018 (MIT Press) - A foundational textbook offering a comprehensive explanation of temporal-difference learning, including SARSA, Q-learning, and Expected SARSA, covering their properties and comparative analysis.
Reinforcement Learning Lecture 5: Model-Free Control, David Silver, 2015 - An influential lecture from a prominent researcher, providing clear explanations and comparisons of SARSA, Q-learning, and Expected SARSA within the context of model-free control.