Dense Passage Retrieval for Open-Domain Question Answering, Vladimir Karpukhin, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih, 2020EMNLP 2020DOI: 10.48550/arXiv.2004.04906 - Presents Dense Passage Retrieval (DPR), a method that uses dense vector representations (embeddings) to efficiently retrieve relevant passages for question answering, directly applying semantic search.
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks, Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela, 2020Advances in Neural Information Processing Systems (NeurIPS 2020)DOI: 10.48550/arXiv.2005.11401 - The foundational paper introducing Retrieval-Augmented Generation (RAG), explaining how combining information retrieval with generation enhances large language models.