llama.cpp repository, Georgi Gerganov and the llama.cpp Community Contributors, 2024 - The project's repository and associated documentation, detailing the GGUF file format and its specific quantization schemes (e.g., Q_K variants) used for local LLM inference.