LangChain Document Loaders, LangChain, 2024 (LangChain) - Official documentation detailing the various DocumentLoader classes and their use for data ingestion in LangChain.
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS), Vol. 33DOI: 10.48550/arXiv.2005.11401 - A foundational paper introducing the Retrieval-Augmented Generation (RAG) paradigm, offering the high-level design for systems that require data ingestion.
Document Object API Reference, LangChain, 2024 - Official API reference for LangChain's fundamental Document object, detailing its attributes like page_content and metadata.