Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Open compass VLMEvalKit MMHelix Minesweeper Eval

From Leeroopedia
Field Value
source VLMEvalKit
domain Vision, Evaluation, Puzzle Solving, Minesweeper

Overview

Evaluates Minesweeper puzzle solutions in the MMHelix benchmark by comparing predicted mine coordinates with ground truth positions.

Description

The `MinesweeperEvaluator` class extends `BaseEvaluator` to validate Minesweeper solutions by comparing sets of mine coordinates. It supports both string and list input formats for coordinates. The `_extract_coordinates` method parses coordinate strings in formats like "(0,5),(0,7),(1,1)" into sets of (row, col) tuples. Evaluation is a strict set equality check between predicted and ground truth mine positions.

Usage

Called internally by the corresponding dataset class during evaluation.

Code Reference

  • Source: vlmeval/dataset/utils/mmhelix/evaluators/minesweeper_eval.py, Lines: L1-95
  • Import: from vlmeval.dataset.utils.mmhelix.evaluators.minesweeper_eval import MinesweeperEvaluator

Key Functions:

class MinesweeperEvaluator(BaseEvaluator):
    def evaluate(self, predicted_answer, ground_truth, initial_state) -> bool: ...
    def _extract_coordinates(self, coord_str) -> set: ...

I/O Contract

Direction Description
Inputs Predicted mine coordinates as string or list; ground truth coordinates as string or list
Outputs Boolean indicating whether predicted mine positions exactly match ground truth

Usage Examples

from vlmeval.dataset.utils.mmhelix.evaluators.minesweeper_eval import MinesweeperEvaluator

evaluator = MinesweeperEvaluator()
is_correct = evaluator.evaluate("(0,5),(1,2)", "(0,5),(1,2)", None)

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment