Principle:Ggml org Llama cpp Quantization
Appearance
| Knowledge Sources | Domains | Last Updated |
|---|---|---|
| ggml-org/llama.cpp | Model Quantization, Compression | 2026-02-15 |
Overview
Description
Quantization is a design principle in the llama.cpp project covering model quantization and compression.
Usage
See linked implementation pages for concrete usage details.
Related Pages
Page Connections
Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment