Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Provides a comprehensive introduction to deep learning, including dedicated sections on sequence modeling, recurrent neural networks, and their applications to various tasks like prediction and generation.
Sequence to Sequence Learning with Neural Networks, Ilya Sutskever, Oriol Vinyals, Quoc V. Le, 2014Advances in Neural Information Processing Systems, Vol. 27 (NeurIPS) - A foundational paper introducing the sequence-to-sequence model, which became a standard for tasks like machine translation and summarization, where input and output sequences have different lengths.
Neural Machine Translation by Jointly Learning to Align and Translate, Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio, 2014International Conference on Learning Representations (ICLR 2015)DOI: 10.48550/arXiv.1409.0473 - Introduced the attention mechanism with sequence-to-sequence models, significantly advancing neural machine translation and influencing various other sequence modeling tasks.
CS224n: Natural Language Processing with Deep Learning, Stanford University, 2023 (Stanford University) - A widely recognized university course offering comprehensive lectures and materials on deep learning methods for natural language processing, covering various sequence modeling tasks.