Python toolkit for building production-ready LLM applications. Modular utilities for prompts, RAG, agents, structured outputs, and multi-provider support.
Was this section helpful?
Quantization API (torch.ao.quantization), PyTorch Documentation, 2024 - Official documentation for PyTorch's quantization API, including QAT implementation details.