Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS)DOI: 10.48550/arXiv.2005.11401 - This paper introduces Retrieval Augmented Generation (RAG), explaining how combining an information retrieval system with a pre-trained language model can enhance performance on knowledge-intensive tasks.
What is a Vector Database?, Roie Schwaber-Cohen, 2023 (Pinecone) - Provides an introduction to vector databases, their components, and how they facilitate semantic search using vector embeddings and similarity metrics.
Neural Language Models and Word Embeddings, Dan Jurafsky and James H. Martin, 2023 (Pearson) - A chapter from a textbook on natural language processing that describes the application of word and sentence embeddings, including vector space models and similarity metrics.