CUDA C++ Programming Guide, NVIDIA Corporation, 2023 (NVIDIA Corporation) - Provides details on the CUDA architecture and parallel computing.
Deep Learning, Ian Goodfellow, Yoshua Bengio, and Aaron Courville, 2016 (MIT Press) - A textbook describing deep learning fundamentals, including computation aspects.
llama.cpp, Georgi Gerganov and the llama.cpp contributors, 2023 - The project repository for a C/C++ implementation of LLM inference, supporting local execution and GGUF models.
Metal Performance Shaders Framework, Apple Developer Documentation, 2024 - Documentation for Apple's framework to accelerate machine learning tasks on Apple Silicon using Metal.