Hybrid Search

Hybrid search is a technique that combines keyword-based full-text search (such as BM25) with vector search (semantic search), leveraging the strengths of both to improve retrieval accuracy.
The single biggest factor in RAG retrieval accuracy is whether relevant documents are correctly retrieved. Vector search alone struggles with exact matches for proper nouns like "ISO 27001," while BM25 alone cannot handle semantic paraphrases like "international standard for information security." Hybrid search compensates for these two weaknesses.
The most common implementation pattern runs BM25 and vector search independently, then merges results using RRF (Reciprocal Rank Fusion). It sums the reciprocals of each method's rankings to produce a final reranked score. The formula is simple, yet it consistently outperforms either search method alone.
Combining pgvector with PostgreSQL's full-text search enables hybrid search without additional infrastructure. On Supabase, maintaining both a vector column and a tsvector column in the same table and computing both scores within SQL is a practical approach.
Chunk size design also affects accuracy. Smaller chunks improve vector search precision but lose context for BM25. In practice, chunks of 500-1000 tokens with overlapping context from adjacent segments are common.
Related Terms

AI ROI (Return on Investment in AI)
AI ROI is a metric that quantitatively measures the effects obtained — such as operational efficienc

AI Observability
An operational practice of continuously monitoring and visualizing the inputs/outputs, latency, cost

Ambient AI
Ambient AI refers to an AI system that is seamlessly embedded in the user's environment, continuousl

BPO (Business Process Outsourcing)
BPO refers to a form of outsourcing in which a company delegates specific business processes to an e