Speech and Language Processing, Daniel Jurafsky and James H. Martin, 2025 (Prentice Hall) - A comprehensive textbook covering statistical language modeling and perplexity as a core evaluation metric.
A Neural Probabilistic Language Model, Yoshua Bengio, Réjean Ducharme, Pascal Vincent, and Christian Jauvin, 2003Journal of Machine Learning Research, Vol. 3DOI: 10.1162/jmlr.2003.3.nov.1137 - A seminal paper introducing neural probabilistic language models, which uses perplexity as a primary evaluation metric.
Perplexity calculation for language models, Hugging Face, 2024 - Provides practical guidance and examples for calculating perplexity using the Hugging Face Transformers library, relevant for modern LLM evaluation.