Implementation:Open compass VLMEvalKit OmniLMM12B
| Field | Value |
|---|---|
| source | VLMEvalKit |
| domain | Vision, Model_Architecture |
Overview
VLM adapter for the OmniLMM-12B model enabling benchmark evaluation in VLMEvalKit.
Description
OmniLMM12B inherits from BaseModel and wraps the OmniLMM-12B model for use within the VLMEvalKit evaluation framework. It initializes the model and tokenizer/processor from a HuggingFace model path (default: model_path, root (required)) and provides the generate_inner method for inference. Uses a custom model initialization function with OmniLMMForCausalLM and separate image processor.
Usage
Register in vlmeval/config.py via supported_VLM and invoke through the standard evaluation pipeline.
Code Reference
- Source:
vlmeval/vlm/omnilmm.py, Lines: L1-183 - Import:
from vlmeval.vlm.omnilmm import OmniLMM12B
Signature:
class OmniLMM12B(BaseModel):
INSTALL_REQ = True
INTERLEAVE = False
def __init__(self, model_path='model_path, root (required)', **kwargs): ...
def generate_inner(self, message, dataset=None): ...
I/O Contract
| Direction | Description |
|---|---|
| Inputs | message — list of dicts with type (text/image) and value; dataset — optional dataset name for custom prompting |
| Outputs | generate_inner() returns str (model response text) |
Usage Examples
from vlmeval.vlm.omnilmm import OmniLMM12B
model = OmniLMM12B(model_path='path/to/model')
response = model.generate_inner(message)