Long Short-Term Memory, Sepp Hochreiter, Jürgen Schmidhuber, 1997Neural Computation, Vol. 9 (MIT Press)DOI: 10.1162/neco.1997.9.8.1735 - The foundational paper introducing the Long Short-Term Memory (LSTM) network architecture, detailing its core components and addressing the vanishing gradient problem.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive textbook covering the theoretical foundations and practical aspects of deep learning, including a detailed explanation of recurrent neural networks and LSTMs.
Recurrent Neural Networks and LSTMs, Stanford University CS224N Course Staff, 2019 - Detailed lecture notes from a leading university course, providing an accessible yet rigorous explanation of RNNs and LSTMs, including their architecture and functioning.