Apache Kafka Documentation, The Apache Kafka Community, 2024 - Official documentation for Apache Kafka, covering its architecture, message broker functionalities, stream processing with Kafka Streams, and integration patterns for real-time data pipelines.
Apache Flink Documentation, The Apache Flink Community, 2025 - Official documentation for Apache Flink, providing details on its capabilities as a distributed stream processing engine, state management, and event-time processing, suitable for building dependable RAG update handlers.
Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems, Martin Kleppmann, 2017 (O'Reilly Media) - A primary book discussing principles of building data systems, covering data models, replication, consistency, distributed transactions, stream processing, and challenges like idempotency and schema evolution, directly applicable to designing dependable CDC pipelines.