LightRAG/docs/rerank_integration.md
2025-07-08 11:16:34 +08:00

7.2 KiB

Rerank Integration in LightRAG

This document explains how to configure and use the rerank functionality in LightRAG to improve retrieval quality.

Overview

Reranking is an optional feature that improves the quality of retrieved documents by re-ordering them based on their relevance to the query. This is particularly useful when you want higher precision in document retrieval across all query modes (naive, local, global, hybrid, mix).

Architecture

The rerank integration follows a simplified design pattern:

  • Single Function Configuration: All rerank settings (model, API keys, top_k, etc.) are contained within the rerank function
  • Async Processing: Non-blocking rerank operations
  • Error Handling: Graceful fallback to original results
  • Optional Feature: Can be enabled/disabled via configuration
  • Code Reuse: Single generic implementation for Jina/Cohere compatible APIs

Configuration

Environment Variables

Set this variable in your .env file or environment:

# Enable/disable reranking
ENABLE_RERANK=True

Programmatic Configuration

from lightrag import LightRAG
from lightrag.rerank import custom_rerank, RerankModel

# Method 1: Using a custom rerank function with all settings included
async def my_rerank_func(query: str, documents: list, top_k: int = None, **kwargs):
    return await custom_rerank(
        query=query,
        documents=documents,
        model="BAAI/bge-reranker-v2-m3",
        base_url="https://api.your-provider.com/v1/rerank",
        api_key="your_api_key_here",
        top_k=top_k or 10,  # Handle top_k within the function
        **kwargs
    )

rag = LightRAG(
    working_dir="./rag_storage",
    llm_model_func=your_llm_func,
    embedding_func=your_embedding_func,
    enable_rerank=True,
    rerank_model_func=my_rerank_func,
)

# Method 2: Using RerankModel wrapper
rerank_model = RerankModel(
    rerank_func=custom_rerank,
    kwargs={
        "model": "BAAI/bge-reranker-v2-m3",
        "base_url": "https://api.your-provider.com/v1/rerank",
        "api_key": "your_api_key_here",
    }
)

rag = LightRAG(
    working_dir="./rag_storage",
    llm_model_func=your_llm_func,
    embedding_func=your_embedding_func,
    enable_rerank=True,
    rerank_model_func=rerank_model.rerank,
)

Supported Providers

For Jina/Cohere compatible APIs:

from lightrag.rerank import custom_rerank

# Your custom API endpoint
result = await custom_rerank(
    query="your query",
    documents=documents,
    model="BAAI/bge-reranker-v2-m3",
    base_url="https://api.your-provider.com/v1/rerank",
    api_key="your_api_key_here",
    top_k=10
)

2. Jina AI

from lightrag.rerank import jina_rerank

result = await jina_rerank(
    query="your query",
    documents=documents,
    model="BAAI/bge-reranker-v2-m3",
    api_key="your_jina_api_key",
    top_k=10
)

3. Cohere

from lightrag.rerank import cohere_rerank

result = await cohere_rerank(
    query="your query",
    documents=documents,
    model="rerank-english-v2.0",
    api_key="your_cohere_api_key",
    top_k=10
)

Integration Points

Reranking is automatically applied at these key retrieval stages:

  1. Naive Mode: After vector similarity search in _get_vector_context
  2. Local Mode: After entity retrieval in _get_node_data
  3. Global Mode: After relationship retrieval in _get_edge_data
  4. Hybrid/Mix Modes: Applied to all relevant components

Configuration Parameters

Parameter Type Default Description
enable_rerank bool False Enable/disable reranking
rerank_model_func callable None Custom rerank function containing all configurations (model, API keys, top_k, etc.)

Example Usage

Basic Usage

import asyncio
from lightrag import LightRAG, QueryParam
from lightrag.llm.openai import gpt_4o_mini_complete, openai_embedding
from lightrag.rerank import jina_rerank

async def my_rerank_func(query: str, documents: list, top_k: int = None, **kwargs):
    """Custom rerank function with all settings included"""
    return await jina_rerank(
        query=query,
        documents=documents,
        model="BAAI/bge-reranker-v2-m3",
        api_key="your_jina_api_key_here",
        top_k=top_k or 10,  # Default top_k if not provided
        **kwargs
    )

async def main():
    # Initialize with rerank enabled
    rag = LightRAG(
        working_dir="./rag_storage",
        llm_model_func=gpt_4o_mini_complete,
        embedding_func=openai_embedding,
        enable_rerank=True,
        rerank_model_func=my_rerank_func,
    )

    # Insert documents
    await rag.ainsert([
        "Document 1 content...",
        "Document 2 content...",
    ])

    # Query with rerank (automatically applied)
    result = await rag.aquery(
        "Your question here",
        param=QueryParam(mode="hybrid", top_k=5)  # This top_k is passed to rerank function
    )

    print(result)

asyncio.run(main())

Direct Rerank Usage

from lightrag.rerank import custom_rerank

async def test_rerank():
    documents = [
        {"content": "Text about topic A"},
        {"content": "Text about topic B"},
        {"content": "Text about topic C"},
    ]

    reranked = await custom_rerank(
        query="Tell me about topic A",
        documents=documents,
        model="BAAI/bge-reranker-v2-m3",
        base_url="https://api.your-provider.com/v1/rerank",
        api_key="your_api_key_here",
        top_k=2
    )

    for doc in reranked:
        print(f"Score: {doc.get('rerank_score')}, Content: {doc.get('content')}")

Best Practices

  1. Self-Contained Functions: Include all necessary configurations (API keys, models, top_k handling) within your rerank function
  2. Performance: Use reranking selectively for better performance vs. quality tradeoff
  3. API Limits: Monitor API usage and implement rate limiting within your rerank function
  4. Fallback: Always handle rerank failures gracefully (returns original results)
  5. Top-k Handling: Handle top_k parameter appropriately within your rerank function
  6. Cost Management: Consider rerank API costs in your budget planning

Troubleshooting

Common Issues

  1. API Key Missing: Ensure API keys are properly configured within your rerank function
  2. Network Issues: Check API endpoints and network connectivity
  3. Model Errors: Verify the rerank model name is supported by your API
  4. Document Format: Ensure documents have content or text fields

Debug Mode

Enable debug logging to see rerank operations:

import logging
logging.getLogger("lightrag.rerank").setLevel(logging.DEBUG)

Error Handling

The rerank integration includes automatic fallback:

# If rerank fails, original documents are returned
# No exceptions are raised to the user
# Errors are logged for debugging

API Compatibility

The generic rerank API expects this response format:

{
  "results": [
    {
      "index": 0,
      "relevance_score": 0.95
    },
    {
      "index": 2,
      "relevance_score": 0.87
    }
  ]
}

This is compatible with:

  • Jina AI Rerank API
  • Cohere Rerank API
  • Custom APIs following the same format