Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Open compass VLMEvalKit MMHelix Minesweeper Eval

From Leeroopedia
Revision as of 13:30, 16 February 2026 by Admin (talk | contribs) (Auto-imported from implementations/Open_compass_VLMEvalKit_MMHelix_Minesweeper_Eval.md)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Field Value
source VLMEvalKit
domain Vision, Evaluation, Puzzle Solving, Minesweeper

Overview

Evaluates Minesweeper puzzle solutions in the MMHelix benchmark by comparing predicted mine coordinates with ground truth positions.

Description

The `MinesweeperEvaluator` class extends `BaseEvaluator` to validate Minesweeper solutions by comparing sets of mine coordinates. It supports both string and list input formats for coordinates. The `_extract_coordinates` method parses coordinate strings in formats like "(0,5),(0,7),(1,1)" into sets of (row, col) tuples. Evaluation is a strict set equality check between predicted and ground truth mine positions.

Usage

Called internally by the corresponding dataset class during evaluation.

Code Reference

  • Source: vlmeval/dataset/utils/mmhelix/evaluators/minesweeper_eval.py, Lines: L1-95
  • Import: from vlmeval.dataset.utils.mmhelix.evaluators.minesweeper_eval import MinesweeperEvaluator

Key Functions:

class MinesweeperEvaluator(BaseEvaluator):
    def evaluate(self, predicted_answer, ground_truth, initial_state) -> bool: ...
    def _extract_coordinates(self, coord_str) -> set: ...

I/O Contract

Direction Description
Inputs Predicted mine coordinates as string or list; ground truth coordinates as string or list
Outputs Boolean indicating whether predicted mine positions exactly match ground truth

Usage Examples

from vlmeval.dataset.utils.mmhelix.evaluators.minesweeper_eval import MinesweeperEvaluator

evaluator = MinesweeperEvaluator()
is_correct = evaluator.evaluate("(0,5),(1,2)", "(0,5),(1,2)", None)

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment