Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - Offers a fundamental understanding of deep learning model evaluation, discussing test sets and metrics.
Learn the Basics: Training and evaluation loop, PyTorch Developers, 2024 (PyTorch Foundation) - Official documentation describing model.eval() and torch.no_grad(), practices for accurate PyTorch model evaluation.