Hybrid Searchとは？

Hybrid Search

Updated:March 19, 2026Published:March 19, 2026

Hybrid search is a technique that combines keyword-based full-text search (such as BM25) with vector search (semantic search), leveraging the strengths of both to improve retrieval accuracy.

The single biggest factor in RAG retrieval accuracy is whether relevant documents are correctly retrieved. Vector search alone struggles with exact matches for proper nouns like "ISO 27001," while BM25 alone cannot handle semantic paraphrases like "international standard for information security." Hybrid search compensates for these two weaknesses.

The most common implementation pattern runs BM25 and vector search independently, then merges results using RRF (Reciprocal Rank Fusion). It sums the reciprocals of each method's rankings to produce a final reranked score. The formula is simple, yet it consistently outperforms either search method alone.

Combining pgvector with PostgreSQL's full-text search enables hybrid search without additional infrastructure. On Supabase, maintaining both a vector column and a tsvector column in the same table and computing both scores within SQL is a practical approach.

Chunk size design also affects accuracy. Smaller chunks improve vector search precision but lose context for BM25. In practice, chunks of 500-1000 tokens with overlapping context from adjacent segments are common.

Hybrid Search

Related Terms

AI ROI (Return on Investment in AI)

AI Observability

Ambient AI

BPO (Business Process Outsourcing)