Version Control and Experiment Tracking for RAG Components
Was this section helpful?
Data Version Control (DVC) Documentation, Iterative, 2023 - The official guide for DVC, a tool designed for data versioning, pipeline management, and experiment tracking, fundamental for managing data artifacts like knowledge bases and processed data in RAG systems.
Hidden Technical Debt in Machine Learning Systems, D. Sculley, Gary Holt, Daniel Golovin, Eugene Davydov, Todd Phillips, Dietmar Ebner, Vinay Chaudhary, Michael Young, Jean-François Crespo, Dan Dennison, 2015Advances in Neural Information Processing Systems, Vol. 28 (Curran Associates, Inc.) - A foundational paper that highlights the challenges and hidden costs of deploying and maintaining machine learning systems in production, emphasizing the necessity of robust engineering practices like version control and experiment tracking.