Hybrid Search at Scale: Combining Dense and Sparse Retrievers
Was this section helpful?
Okapi at TREC-3, S. E. Robertson, S. Walker, S. Jones, M. M. Hancock-Beaulieu, M. Gatford, 1995Proceedings of the Third Text REtrieval Conference (TREC 3) (NIST) - A foundational paper that introduces the Okapi BM25 ranking function, a cornerstone algorithm for sparse lexical retrieval systems.
Reciprocal Rank Fusion: A Unified Rank Fusion Method, Gordon V. Cormack, Charles L. A. Clarke, and Stefan Buettcher, 2009Proceedings of the 32nd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR) (ACM)DOI: 10.1145/1571941.1572023 - Presents Reciprocal Rank Fusion (RRF) as an effective and robust method for combining ranked lists from different retrieval systems without requiring score normalization.
SPLADE: Sparse Lexical and Aspect-based Document Embeddings, Thibault Formal, Benjamin Piwowarski, Stéphane Clinchant, 2021Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM (Association for Computing Machinery))DOI: 10.1145/3404835.3463098 - Introduces SPLADE, a method that learns sparse vector representations suitable for efficient indexing with traditional inverted indexes, offering a semantically enhanced sparse retrieval approach.