Computer Architecture: A Quantitative Approach, John L. Hennessy and David A. Patterson, 2017 (Morgan Kaufmann) - This textbook provides a comprehensive treatment of CPU design, including instruction sets, cores, and parallel processing, which are fundamental to understanding CPU performance for LLMs.
Dive into Deep Learning, Aston Zhang, Zachary C. Lipton, Mu Li, and Alexander J. Smola, 2024 (Cambridge University Press) - This online book's introductory chapters discuss the computational demands of deep learning and the roles of CPUs and GPUs in managing these workloads, providing context for LLM hardware.
Intel® Software Developer Manuals, Intel Corporation, 2025 (Intel Corporation) - This official collection of manuals provides detailed information on Intel CPU architectures, including instruction set extensions like AVX2, and guidelines for optimizing software for better performance, directly relevant to LLM CPU inference.