Jump to content

Connect Leeroopedia MCP: Equip your AI agents to search best practices, build plans, verify code, diagnose failures, and look up hyperparameter defaults.

Environment:Sgl project Sglang CPU Runtime

From Leeroopedia


Sgl_project_Sglang_CPU_Runtime is the CPU inference runtime environment with Intel AMX/AVX-512 SIMD acceleration for SGLang, providing optimized kernels for attention, GEMM, MoE, and other operations on x86 processors.

Requirements

  • Linux x86_64 with Intel AMX tile support (Xeon 4th Gen+ / Sapphire Rapids)
  • AVX-512 instruction set support
  • Python 3.10+
  • PyTorch (CPU build)
  • `sgl-kernel` with CPU backend
  • `sglang-cpu` package
  • C++ compiler with AVX-512 and AMX intrinsics support
  • Environment variable `SGLANG_USE_CPU_ENGINE=1`

Required By

See Also

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment