Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Principle:Infiniflow Ragflow Retrieval Configuration

From Leeroopedia
Knowledge Sources
Domains RAG, Information_Retrieval
Last Updated 2026-02-12 06:00 GMT

Overview

A configuration pattern that tunes hybrid retrieval parameters including similarity thresholds, vector/keyword weighting, and reranking.

Description

Retrieval Configuration controls how the RAG system searches for relevant chunks at chat time. Key parameters include similarity_threshold (minimum score to include), vector_similarity_weight (balance between vector and keyword search), top_n (chunks returned to LLM context), top_k (initial pool size), and optional rerank_id (cross-encoder reranking model).

Usage

Configure when tuning chat application retrieval quality. Lower thresholds increase recall but may include less relevant chunks; higher vector weights favor semantic matching over exact keyword matching.

Theoretical Basis

Retrieval quality is the primary determinant of RAG answer quality. The key trade-offs are:

  • Precision vs Recall: Similarity threshold controls the cutoff
  • Semantic vs Lexical: Vector weight balances dense and sparse retrieval
  • Reranking: Cross-encoder reranking improves precision at the cost of latency

Related Pages

Implemented By

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment