Connectionist Temporal Classification: Labelling Sequences without Alignment, Alex Graves, Santiago Fernandez, Faustino Gomez, and Jürgen Schmidhuber, 2006Proceedings of the 23rd International Conference on Machine Learning - ICML '06 (ACM)DOI: 10.1145/1143812.1143862 - The original foundational paper introducing the Connectionist Temporal Classification (CTC) algorithm, including its mathematical formulation, the concept of blank labels, the dynamic programming approach for loss computation, and basic decoding strategies.
Deep Speech: Scaling up end-to-end speech recognition, Awni Hannun, Carl Case, Jared Casper, Bryan Catanzaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, Andrew Y. Ng, 2014arXiv preprint arXiv:1412.5567DOI: 10.48550/arXiv.1412.5567 - This paper presents a practical application of CTC in a large-scale end-to-end speech recognition system, providing details on its implementation, training, and effective decoding strategies including beam search with language model integration.
Speech and Language Processing (3rd ed. draft), Daniel Jurafsky and James H. Martin, 2025 (Online Draft) - An established textbook offering a pedagogical and comprehensive explanation of Connectionist Temporal Classification within the broader context of automatic speech recognition, suitable for advanced students seeking a structured overview.
Supervised Sequence Labelling with Recurrent Neural Networks, Alex Graves, 2012 Vol. 385 (Springer)DOI: 10.1007/978-3-642-24797-2 - Based on the author's Ph.D. thesis, this book provides an in-depth theoretical explanation of Connectionist Temporal Classification, its underlying principles, and its connections to other sequence labeling techniques using recurrent neural networks.