Implementation:Mit han lab Llm awq Apply awq

Overview

Concrete tool for applying precomputed AWQ transforms to a model provided by the llm-awq library.

File: awq/quantize/pre_quant.py, Lines: 252-254

def apply_awq(model, awq_results):
    apply_scale(model, awq_results["scale"])
    apply_clip(model, awq_results["clip"])

from awq.quantize.pre_quant import apply_awq

Inputs:

model (nn.Module) - FP16 model
awq_results (dict) - dictionary with "scale" and "clip" keys, loaded via torch.load

Output:

Double-click a node to navigate. Hold to expand connections.

Principle

Implementation

Heuristic

Environment