Implementation:SeldonIO Seldon core Seldon Model Load HuggingFace

Field	Value
Type	External Tool Doc
Overview	Concrete CLI tools for deploying and verifying HuggingFace models on Seldon Core 2.
Source	`samples/huggingface.md:L24-40`, `docs-gb/cli/seldon_model_load.md:L1-30`
Domains	MLOps, NLP, Kubernetes
Implements Principle	SeldonIO_Seldon_core_HuggingFace_Model_Deployment_And_Verification
External Dependencies	seldon CLI, kubectl, MLServer with HuggingFace runtime
Knowledge Sources	Repo (https://github.com/SeldonIO/seldon-core), Doc (https://docs.seldon.io/projects/seldon-core/en/v2/)
Last Updated	2026-02-13 00:00 GMT

Code Reference

Loading a HuggingFace Model

seldon model load -f hf-sentiment.yaml

Waiting for Model Availability

seldon model status sentiment -w ModelAvailable

Combined Load and Wait

seldon model load -f hf-sentiment.yaml && seldon model status sentiment -w ModelAvailable

Key Parameters

Parameter	Flag	Description
file-path	`-f` / `--file-path`	Path to the HuggingFace Model CRD YAML manifest
wait condition	`-w` / `--wait`	Condition to wait for; use `"ModelAvailable"` for readiness confirmation
model name	positional argument	Model name as declared in `metadata.name` of the Model CRD (used with `seldon model status`)

I/O Contract

Inputs

Input	Format	Description
HuggingFace Model CRD YAML	YAML file	Model manifest with `requirements: ["huggingface"]` and a `storageUri` pointing to serialized artifacts

Outputs

Output	Format	Description
HuggingFace model loaded on MLServer	Runtime state	The model is deployed on an MLServer instance with the HuggingFace runtime and is ready to accept inference requests
Model status JSON	JSON	Status output confirming the model condition (e.g., `ModelAvailable`)

Usage Examples

Deploying the sentiment analysis model

seldon model load -f samples/models/hf-sentiment.yaml
seldon model status sentiment -w ModelAvailable

Deploying the text generation model

seldon model load -f samples/models/hf-text-gen.yaml
seldon model status text-gen -w ModelAvailable

Deploying the Whisper speech-to-text model

seldon model load -f samples/models/hf-whisper.yaml
seldon model status whisper -w ModelAvailable

Deploying all HuggingFace models for a pipeline

When preparing models for a multi-modal pipeline, deploy all required models sequentially:

seldon model load -f samples/models/hf-whisper.yaml
seldon model status whisper -w ModelAvailable

seldon model load -f samples/models/hf-sentiment.yaml
seldon model status sentiment -w ModelAvailable

seldon model load -f samples/models/hf-text-gen.yaml
seldon model status text-gen -w ModelAvailable

Checking model status without waiting

seldon model status sentiment

This returns the current status JSON without blocking, useful for monitoring or scripting.

Related Pages

SeldonIO_Seldon_core_HuggingFace_Model_Deployment_And_Verification -- principle that this implementation realizes
SeldonIO_Seldon_core_Seldon_Model_CRD_HuggingFace -- depends on the Model CRD YAML manifest that is loaded
SeldonIO_Seldon_core_Seldon_Model_Infer_BYTES -- enables inference requests after the model is deployed and verified
SeldonIO_Seldon_core_Seldon_Model_Load -- specializes the general model load command for HuggingFace models
Environment:SeldonIO_Seldon_core_GPU_Inference_Environment

Page Connections

Double-click a node to navigate. Hold to expand connections.

Principle

Implementation

Heuristic

Environment