Principle:Confident ai Deepeval Retriever Span Enrichment

Overview

Retriever Span Enrichment is the principle of enriching retriever-type spans with retrieval-specific metadata. By recording parameters such as the embedding model, top-k value, and chunk size for each retrieval operation, teams gain visibility into the configuration and behavior of their retrieval pipelines, enabling systematic analysis of retrieval quality.

Core Concept

Retrieval-augmented generation (RAG) applications depend heavily on the quality of their retrieval step. The effectiveness of retrieval is governed by several configuration parameters that directly impact the relevance and completeness of retrieved context. Enriching retriever spans with this data is essential because:

Embedding model tracking -- Recording which embedding model (embedder) was used for each retrieval operation allows teams to compare retrieval quality across different embedding models and track the impact of model changes.
Top-k analysis -- The top_k parameter controls how many documents are retrieved. Tracking this value per span enables correlation between retrieval breadth and downstream answer quality.
Chunk size monitoring -- The chunk_size parameter determines the granularity of retrieved text segments. Different chunk sizes affect both retrieval precision and the amount of context provided to the LLM.

Theoretical Basis

This principle is grounded in information retrieval and embedding-based search practices:

Information retrieval monitoring -- The practice of instrumenting retrieval systems to track the parameters that govern search quality, enabling data-driven tuning of retrieval configurations.
Embedding model tracking -- Treating the choice of embedding model as a first-class configuration parameter whose impact on retrieval quality should be measured and compared.

Why It Matters

Without retriever span enrichment:

Retrieval quality issues are opaque -- teams cannot determine whether poor answers stem from the retrieval step or the generation step
Configuration drift goes undetected -- changes to top-k or chunk size parameters are not recorded alongside their impact on output quality
Embedding model comparisons lack data -- no structured way to compare the effectiveness of different embedding models in production
RAG optimization is ad hoc -- without per-retrieval metadata, tuning retrieval parameters relies on guesswork rather than data

Retriever span enrichment provides the data foundation for systematic RAG pipeline optimization.

Relationship to Implementation

This principle is realized through the update_retriever_span function, which injects retrieval-specific metadata into the current retriever-type span.

Implementation:Confident_ai_Deepeval_Update_Retriever_Span

Metadata

DeepEval Tracing Observability LLM_Evaluation 2026-02-14 09:00 GMT

Page Connections

Double-click a node to navigate. Hold to expand connections.

Principle

Implementation

Heuristic

Environment