Implementation:Open compass VLMEvalKit SlideVQA
Appearance
| Field | Value |
|---|---|
| source | VLMEvalKit |
| domain | Vision, Benchmarking, Slide Document Understanding |
Overview
Benchmark dataset implementation for SlideVQA evaluation in VLMEvalKit.
Description
SlideVQA inherits from ImageBaseDataset and implements the SlideVQA benchmark for slide document understanding. The TYPE field is set to 'VQA'. It supports full and mini splits with per-model page handling configurations similar to MMLongBench.
Usage
Registered in vlmeval/dataset/__init__.py and invoked through build_dataset() by benchmark name.
Code Reference
- Source:
vlmeval/dataset/slidevqa.py, Lines: L1-189 - Import:
from vlmeval.dataset.slidevqa import SlideVQA
Signature:
class SlideVQA(ImageBaseDataset):
TYPE = 'VQA'
DATASET_URL = {...}
DATASET_MD5 = {...}
...
I/O Contract
| Direction | Description |
|---|---|
| Inputs | TSV dataset file with image/video paths and questions |
| Outputs | Evaluation results DataFrame with scores per category |
Usage Examples
from vlmeval.dataset import build_dataset
dataset = build_dataset('SLIDEVQA')
Related Pages
Page Connections
Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment