Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Open compass VLMEvalKit PaliGemma

From Leeroopedia
Field Value
source VLMEvalKit
domain Vision, Model_Architecture

Overview

VLM adapter for the PaliGemma model enabling benchmark evaluation in VLMEvalKit.

Description

PaliGemma inherits from BaseModel and wraps the PaliGemma model for use within the VLMEvalKit evaluation framework. It initializes the model and tokenizer/processor from a HuggingFace model path (default: google/paligemma-3b-mix-448) and provides the generate_inner method for inference. Also includes Gemma3 adapter class for the Gemma 3 model family.

Usage

Register in vlmeval/config.py via supported_VLM and invoke through the standard evaluation pipeline.

Code Reference

  • Source: vlmeval/vlm/gemma.py, Lines: L1-229
  • Import: from vlmeval.vlm.gemma import PaliGemma

Signature:

class PaliGemma(BaseModel):
    INSTALL_REQ = False
    INTERLEAVE = False
    def __init__(self, model_path='google/paligemma-3b-mix-448', **kwargs): ...
    def generate_inner(self, message, dataset=None): ...

I/O Contract

Direction Description
Inputs message — list of dicts with type (text/image) and value; dataset — optional dataset name for custom prompting
Outputs generate_inner() returns str (model response text)

Usage Examples

from vlmeval.vlm.gemma import PaliGemma
model = PaliGemma(model_path='path/to/model')
response = model.generate_inner(message)

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment