Principle:OpenGVLab InternVL LoRA Adapter Injection

Knowledge Sources	LoRA QLoRA InternVL
Domains	Parameter_Efficient_Finetuning, Deep_Learning, NLP
Last Updated	2026-02-07 00:00 GMT

Overview

A parameter-efficient fine-tuning technique that injects low-rank adapter matrices into pretrained model layers, enabling training with a fraction of the full parameter count.

Description

Low-Rank Adaptation (LoRA) is a parameter-efficient fine-tuning method that freezes the pretrained model weights and injects trainable low-rank decomposition matrices into specific layers. Instead of updating the full weight matrix $W \in ℝ^{d \times k}$ , LoRA adds a parallel path $Δ W = B A$ where $B \in ℝ^{d \times r}$ and $A \in ℝ^{r \times k}$ with rank $r ≪ \min (d, k)$ .

In InternVL, LoRA can be applied to:

Language model (LLM): Adapts attention and MLP layers of the LLM backbone
Vision encoder (ViT): Adapts attention and MLP layers of InternViT (less common)

The target modules are automatically selected based on the LLM architecture:

InternLM2: attention.wqkv, attention.wo, feed_forward.w1/w2/w3
Qwen2/LLaMA: self_attn.q/k/v/o_proj, mlp.gate/down/up_proj

Usage

Use LoRA when fine-tuning InternVL on custom datasets with limited GPU memory, or when you want to maintain the base model weights unchanged for multiple task-specific adapters.

Theoretical Basis

The LoRA update rule:

$h = W x + \frac{α}{r} B A x$

Where:

$W$ is the frozen pretrained weight
$B \in ℝ^{d \times r}$ , $A \in ℝ^{r \times k}$ are trainable
$r$ is the rank (typical: 16)
$α$ is the scaling factor (convention in InternVL: $α = 2 r$ )

The trainable parameter count for one LoRA layer: $(d + k) \times r$ , compared to $d \times k$ for full fine-tuning.

InternVL convention:

lora_alpha = 2 * r (scaling factor)
lora_dropout = 0.05 (dropout on LoRA path)
All base model parameters frozen; only LoRA matrices and optionally MLP projector are trainable

Related Pages

Implemented By

Implementation:OpenGVLab_InternVL_Wrap_LLM_LoRA

Uses Heuristic

Heuristic:OpenGVLab_InternVL_LoRA_Alpha_Scaling

Page Connections

Double-click a node to navigate. Hold to expand connections.

Principle

Implementation

Heuristic

Environment