Environment:Sgl project Sglang Runtime
Appearance
Sgl_project_Sglang_Runtime is the SGLang runtime server environment, providing the full model serving stack including HTTP API, scheduler, model executor, and tokenizer management.
Requirements
- Python 3.10+
- SGLang package (`sglang[all]`) installed
- PyTorch 2.9.1+
- Transformers 4.57.1+
- `fastapi`, `uvicorn`, `uvloop` for HTTP serving
- GPU or CPU backend configured
- Model weights accessible (local path or HuggingFace Hub)
- `HF_TOKEN` for gated models (optional)
Required By
See Also
Page Connections
Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment