Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Open compass VLMEvalKit GSM8KVDataset

From Leeroopedia
Revision as of 13:29, 16 February 2026 by Admin (talk | contribs) (Auto-imported from implementations/Open_compass_VLMEvalKit_GSM8KVDataset.md)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Field Value
source VLMEvalKit
domain Vision, Benchmarking, Mathematical Reasoning, Visual

Overview

Benchmark dataset implementation for GSM8K-V evaluation in VLMEvalKit.

Description

GSM8KVDataset inherits from ImageBaseDataset and implements the GSM8K-V benchmark for visual mathematical reasoning. The TYPE field is set to 'VQA'. It supports multiple evaluation modes (text_only, visual_explicit, visual_implicit, all) configured via environment variable.

Usage

Registered in vlmeval/dataset/__init__.py and invoked through build_dataset() by benchmark name.

Code Reference

  • Source: vlmeval/dataset/gsm8k_v.py, Lines: L1-177
  • Import: from vlmeval.dataset.gsm8k_v import GSM8KVDataset

Signature:

class GSM8KVDataset(ImageBaseDataset):
    TYPE = 'VQA'
    DATASET_URL = {...}
    DATASET_MD5 = {...}
    ...

I/O Contract

Direction Description
Inputs TSV dataset file with image/video paths and questions
Outputs Evaluation results DataFrame with scores per category

Usage Examples

from vlmeval.dataset import build_dataset
dataset = build_dataset('GSM8K-V')

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment