Identifying Performance Bottlenecks in RAG Pipelines
Was this section helpful?
Retrieval-Augmented Generation for Large Language Models: A Survey, Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang, 2023arXiv preprint arXiv:2312.10997DOI: 10.48550/arXiv.2312.10997 - A comprehensive and recent survey covering the state-of-the-art in RAG, including an overview of the architecture, key components, and challenges in deploying RAG systems, such as performance and efficiency.
OpenTelemetry Documentation, The OpenTelemetry Authors, 2025 - Official documentation for OpenTelemetry, a vendor-neutral observability framework for collecting telemetry data (traces, metrics, logs) crucial for identifying performance bottlenecks in distributed RAG systems.