Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Open compass VLMEvalKit MathV Utils

From Leeroopedia
Field Value
source VLMEvalKit
domain Vision, Evaluation, Mathematics, Symbolic Comparison

Overview

Provides mathematical answer equivalence checking and GPT-based answer extraction for the MathV (Math Vision) benchmark.

Description

This module implements `is_equal` with a 30-second timeout decorator for comparing answers via direct string matching, numerical evaluation, and LaTeX symbolic parsing using `latex2sympy2`. The `get_gpt4_ICE` function provides in-context learning examples for GPT-4 based answer extraction from model responses. It uses tolerance-based comparison (1e-6) for numerical equivalence.

Usage

Called internally by the corresponding dataset class during evaluation.

Code Reference

  • Source: vlmeval/dataset/utils/mathv.py, Lines: L1-180
  • Import: from vlmeval.dataset.utils.mathv import is_equal, get_gpt4_ICE

Key Functions:

@timeout_decorator.timeout(30)
def is_equal(asw: str, gt_asw: str) -> bool: ...
def get_gpt4_ICE(): ...

I/O Contract

Direction Description
Inputs Two answer strings (predicted and ground truth) for mathematical equivalence checking
Outputs Boolean indicating whether the answers are equivalent

Usage Examples

from vlmeval.dataset.utils.mathv import is_equal

result = is_equal("0.5", "\\frac{1}{2}")
# result = True

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment