Long Short-Term Memory, Sepp Hochreiter and Jürgen Schmidhuber, 1997Neural Computation, Vol. 9 (MIT Press)DOI: 10.1162/neco.1997.9.8.1735 - Original research paper introducing the Long Short-Term Memory (LSTM) architecture, detailing its cell state and gating mechanisms designed to overcome vanishing gradients in RNNs.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive textbook covering the theoretical foundations and practical aspects of deep learning, with dedicated sections on recurrent neural networks and LSTMs.
torch.nn.LSTM, PyTorch Authors, 2024 - Official documentation for PyTorch's LSTM module, providing API details, parameters, and usage examples for practical implementation in deep learning projects.