Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Open compass VLMEvalKit MMReason Utils

From Leeroopedia
Field Value
source VLMEvalKit
domain Vision, Benchmarking, Mathematical Reasoning, Utilities

Overview

Utility module for MMReason evaluation in VLMEvalKit.

Description

This utility file does not define a dataset class. It provides evaluation helper functions for the MMReason mathematical reasoning benchmark including GPT-4 based answer extraction, score comparison, and per-discipline accuracy computation. Key functions include build_MMReason_gpt4_extract_prompt, build_MMReason_gpt4_score_prompt, and MMReason_acc.

Usage

Imported by related dataset modules in the VLMEvalKit evaluation pipeline.

Code Reference

  • Source: vlmeval/dataset/mmreason.py, Lines: L1-249
  • Import: from vlmeval.dataset.mmreason import *

I/O Contract

Direction Description
Inputs Evaluation data, model predictions, ground-truth annotations
Outputs Processed evaluation results and metric scores

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment