Implementation:SeldonIO Seldon core Seldon Model Load HuggingFace
Appearance
| Field | Value |
|---|---|
| Type | External Tool Doc |
| Overview | Concrete CLI tools for deploying and verifying HuggingFace models on Seldon Core 2. |
| Source | samples/huggingface.md:L24-40, docs-gb/cli/seldon_model_load.md:L1-30
|
| Domains | MLOps, NLP, Kubernetes |
| Implements Principle | SeldonIO_Seldon_core_HuggingFace_Model_Deployment_And_Verification |
| External Dependencies | seldon CLI, kubectl, MLServer with HuggingFace runtime |
| Knowledge Sources | Repo (https://github.com/SeldonIO/seldon-core), Doc (https://docs.seldon.io/projects/seldon-core/en/v2/) |
| Last Updated | 2026-02-13 00:00 GMT |
Code Reference
Loading a HuggingFace Model
seldon model load -f hf-sentiment.yaml
Waiting for Model Availability
seldon model status sentiment -w ModelAvailable
Combined Load and Wait
seldon model load -f hf-sentiment.yaml && seldon model status sentiment -w ModelAvailable
Key Parameters
| Parameter | Flag | Description |
|---|---|---|
| file-path | -f / --file-path |
Path to the HuggingFace Model CRD YAML manifest |
| wait condition | -w / --wait |
Condition to wait for; use "ModelAvailable" for readiness confirmation
|
| model name | positional argument | Model name as declared in metadata.name of the Model CRD (used with seldon model status)
|
I/O Contract
Inputs
| Input | Format | Description |
|---|---|---|
| HuggingFace Model CRD YAML | YAML file | Model manifest with requirements: ["huggingface"] and a storageUri pointing to serialized artifacts
|
Outputs
| Output | Format | Description |
|---|---|---|
| HuggingFace model loaded on MLServer | Runtime state | The model is deployed on an MLServer instance with the HuggingFace runtime and is ready to accept inference requests |
| Model status JSON | JSON | Status output confirming the model condition (e.g., ModelAvailable)
|
Usage Examples
Deploying the sentiment analysis model
seldon model load -f samples/models/hf-sentiment.yaml
seldon model status sentiment -w ModelAvailable
Deploying the text generation model
seldon model load -f samples/models/hf-text-gen.yaml
seldon model status text-gen -w ModelAvailable
Deploying the Whisper speech-to-text model
seldon model load -f samples/models/hf-whisper.yaml
seldon model status whisper -w ModelAvailable
Deploying all HuggingFace models for a pipeline
When preparing models for a multi-modal pipeline, deploy all required models sequentially:
seldon model load -f samples/models/hf-whisper.yaml
seldon model status whisper -w ModelAvailable
seldon model load -f samples/models/hf-sentiment.yaml
seldon model status sentiment -w ModelAvailable
seldon model load -f samples/models/hf-text-gen.yaml
seldon model status text-gen -w ModelAvailable
Checking model status without waiting
seldon model status sentiment
This returns the current status JSON without blocking, useful for monitoring or scripting.
Related Pages
- SeldonIO_Seldon_core_HuggingFace_Model_Deployment_And_Verification -- principle that this implementation realizes
- SeldonIO_Seldon_core_Seldon_Model_CRD_HuggingFace -- depends on the Model CRD YAML manifest that is loaded
- SeldonIO_Seldon_core_Seldon_Model_Infer_BYTES -- enables inference requests after the model is deployed and verified
- SeldonIO_Seldon_core_Seldon_Model_Load -- specializes the general model load command for HuggingFace models
- Environment:SeldonIO_Seldon_core_GPU_Inference_Environment
Page Connections
Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment