Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Open compass VLMEvalKit SArena Metrics

From Leeroopedia
Revision as of 13:32, 16 February 2026 by Admin (talk | contribs) (Auto-imported from implementations/Open_compass_VLMEvalKit_SArena_Metrics.md)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Field Value
source VLMEvalKit
domain Vision, Evaluation, Image Generation, Metrics Orchestration

Overview

Orchestrates multiple image generation evaluation metrics for the SArena (InternSVG) benchmark through a configurable metrics registry.

Description

The `InternSVGMetrics` class uses a `MetricsConfig` dataclass to selectively instantiate and manage evaluation metrics including FID, FID-C, CLIP Score (T2I/I2I), DINO Score, LPIPS, SSIM, PSNR, and Token Length. The `calculate_metrics` method iterates over active metrics, computing scores for each and aggregating results. The registry pattern uses lazy initialization via lambda builders for memory-efficient metric loading.

Usage

Called internally by the corresponding dataset class during evaluation.

Code Reference

  • Source: vlmeval/dataset/utils/SArena/metrics.py, Lines: L1-82
  • Import: from vlmeval.dataset.utils.SArena.metrics import InternSVGMetrics, MetricsConfig

Key Functions:

@dataclass
class MetricsConfig: ...

class InternSVGMetrics:
    def calculate_metrics(self, batch): ...
    def reset(self): ...

I/O Contract

Direction Description
Inputs A batch dict containing image data ('pred_im', 'gt_im', 'caption' as applicable) and a MetricsConfig specifying which metrics to use
Outputs Dictionary mapping metric names to their computed average scores

Usage Examples

from vlmeval.dataset.utils.SArena.metrics import InternSVGMetrics, MetricsConfig

config = MetricsConfig(use_FID=True, use_CLIP_Score_T2I=True)
metrics = InternSVGMetrics(config, tokenizer_path="path/to/tokenizer")
results = metrics.calculate_metrics(batch)

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment