RRF (Reciprocal Rank Fusion) is a scoring method that integrates ranking results returned by multiple retrieval methods. By summing the reciprocal ranks from each method, it enables the fusion of different scoring systems without normalization.
In RAG pipelines, a hybrid configuration combining keyword search via BM25 and vector search (semantic search) is common. However, since BM25 scores and cosine similarity differ entirely in scale and distribution, simply adding them together does not produce a meaningful integration.
RRF solves this problem simply. It uses only the "rank" of documents returned by each search method, summing the 1 / (k + rank) score across all methods. The constant k (typically 60) is a parameter that adjusts rank weighting — the larger the value, the smaller the gap between higher and lower ranks.
If a document appears at rank 3 in BM25 and rank 7 in vector search:
RRF score = 1/(60+3) + 1/(60+7) = 0.0159 + 0.0149 = 0.0308
Documents that rank reasonably well in both searches tend to score higher than documents that only appear in one. This contributes to the stability of hybrid search.
Since RRF does not use the absolute values of original scores, it has the advantage of requiring no score calibration between search engines; however, care must be taken when many documents share the same rank or when handling long-tail documents. Major vector databases including Elasticsearch 8.x and later, Weaviate, and Qdrant natively support RRF.


RLHF is a reinforcement learning method that uses human feedback as a reward, while RLVR is a reinforcement learning method that uses verifiable correct answers as a reward; both are used to align LLM outputs with human expectations.

RAG (Retrieval-Augmented Generation) is a technique that improves the accuracy and currency of responses by retrieving relevant information from external knowledge sources and appending the results to the input of an LLM.

Agentic RAG is an architecture in which an LLM autonomously and iteratively generates search queries, evaluates results, and decides whether to re-retrieve information as an agent, achieving answer accuracy that cannot be obtained with simple single-turn RAG.


What is a Vector Database? A Complete Guide to How It Works, Top Product Comparisons, and RAG Applications