Environment:Sgl project Sglang CPU Runtime

Sgl_project_Sglang_CPU_Runtime is the CPU inference runtime environment with Intel AMX/AVX-512 SIMD acceleration for SGLang, providing optimized kernels for attention, GEMM, MoE, and other operations on x86 processors.

Requirements

Linux x86_64 with Intel AMX tile support (Xeon 4th Gen+ / Sapphire Rapids)
AVX-512 instruction set support
Python 3.10+
PyTorch (CPU build)
`sgl-kernel` with CPU backend
`sglang-cpu` package
C++ compiler with AVX-512 and AMX intrinsics support
Environment variable `SGLANG_USE_CPU_ENGINE=1`

Required By

Page Connections

Double-click a node to navigate. Hold to expand connections.

Principle

Implementation

Heuristic

Environment

Requirements

Required By

See Also

Page Connections