Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Kserve Kserve LLMInferenceServiceConfig Minimal CRD

From Leeroopedia
Knowledge Sources
Domains Kubernetes, LLM Inference, CRD
Last Updated 2026-02-13 00:00 GMT

Overview

This file defines the minimal CustomResourceDefinition (CRD) for the LLMInferenceServiceConfig custom resource in the serving.kserve.io API group.

Description

Auto-generated by controller-gen (v0.19.0), this CRD provides a minimal-footprint alternative to the full LLMInferenceServiceConfig CRD. The v1alpha1 version uses x-kubernetes-preserve-unknown-fields to skip detailed field validation, while the v1alpha2 version includes partial schema definitions for fields such as baseRefs, model, and parallelism. The resource is namespaced and supports configuration of LLM inference service templates including model specifications, parallelism settings, and pod templates.

Usage

Use this minimal CRD in development or constrained environments where the full CRD (with complete OpenAPI validation) exceeds size limits or where strict field validation is not required. It is applied to the cluster to register the LLMInferenceServiceConfig resource type before creating any LLMInferenceServiceConfig objects.

Code Reference

Source Location

Signature

apiVersion: apiextensions.k8s.io/v1
kind: CustomResourceDefinition
metadata:
  annotations:
    controller-gen.kubebuilder.io/version: v0.19.0
  name: llminferenceserviceconfigs.serving.kserve.io
spec:
  group: serving.kserve.io
  names:
    kind: LLMInferenceServiceConfig
    listKind: LLMInferenceServiceConfigList
    plural: llminferenceserviceconfigs
    singular: llminferenceserviceconfig
  scope: Namespaced
  versions:
  - name: v1alpha1
    schema:
      openAPIV3Schema:
        properties:
          apiVersion:
            type: string
          kind:
            type: string
          metadata:
            type: object
          spec:
            type: object
            x-kubernetes-map-type: atomic
            x-kubernetes-preserve-unknown-fields: true
        type: object
    served: true
    storage: false
  - name: v1alpha2
    # ... includes partial schema for baseRefs, model, parallelism

Import

kubectl apply -f config/crd/minimal/llmisvc/serving.kserve.io_llminferenceserviceconfigs.yaml

I/O Contract

Field Type Description
apiVersion string Must be apiextensions.k8s.io/v1
kind string Must be CustomResourceDefinition
spec.group string API group: serving.kserve.io
spec.names.kind string LLMInferenceServiceConfig
spec.scope string Namespaced

Versions

Version Storage Served Schema Detail
v1alpha1 false true Minimal schema with x-kubernetes-preserve-unknown-fields
v1alpha2 true true Partial schema with baseRefs, model, parallelism, template

v1alpha2 Key Spec Fields

Field Type Description
spec.baseRefs array References to base configuration objects
spec.model object Model specification with uri (required), name, and lora adapters
spec.parallelism object Parallelism settings: data, dataLocal, dataRPCPort, expert, tensor
spec.template object Pod template specification for containers and init containers

Usage Examples

Register the CRD on a Kubernetes cluster:

# Apply the minimal CRD definition
kubectl apply -f config/crd/minimal/llmisvc/serving.kserve.io_llminferenceserviceconfigs.yaml

# Verify the CRD is registered
kubectl get crd llminferenceserviceconfigs.serving.kserve.io

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment