Long Short-Term Memory, Sepp Hochreiter, Jürgen Schmidhuber, 1997Neural Computation, Vol. 9 (MIT Press)DOI: 10.1162/neco.1997.9.8.1735 - The original paper introducing the Long Short-Term Memory (LSTM) architecture, laying the groundwork for its internal mechanisms.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A comprehensive textbook with a dedicated chapter on recurrent neural networks, including detailed explanations of LSTMs and their theoretical underpinnings.
tf.keras.layers.LSTM, TensorFlow Developers, 2024 - Official documentation for the Keras LSTM layer in TensorFlow, covering its constructor parameters, input/output shapes, and usage examples.
torch.nn.LSTM, PyTorch Developers, 2024 (PyTorch Foundation) - Official documentation for PyTorch's LSTM module, providing specifications for its parameters, states, and data flow.