Speech and Language Processing (3rd ed. draft), Daniel Jurafsky and James H. Martin, 2025 - A comprehensive textbook offering foundational knowledge on language models, including the definition, calculation, and detailed interpretation of perplexity.
CrossEntropyLoss, PyTorch Developers, 2023 - Official documentation for the PyTorch CrossEntropyLoss function, clarifying its relationship to negative log-likelihood and its use in perplexity calculation.
A Survey of Large Language Models, Wayne Xin Zhao, Kun Zhou, Junyi Li, Tianyi Tang, Xiaolei Wang, Yupeng Hou, Yingqian Min, Beichen Zhang, Junjie Zhang, Zican Dong, Yifan Du, Chen Yang, Yushuo Chen, Zhipeng Chen, Jinhao Jiang, Ruiyang Ren, Yifan Li, Xinyu Tang, Zikang Liu, Peiyu Liu, Jian-Yun Nie, Ji-Rong Wen, 2023arXiv preprint arXiv:2303.18223DOI: 10.48550/arXiv.2303.18223 - A comprehensive survey of large language models, including discussions on various evaluation metrics like perplexity and their utility and limitations in contemporary research.