Implementation:Triton inference server Server L0 Sequence Batcher Test
| Knowledge Sources | |
|---|---|
| Domains | Testing, Sequence_Batching |
| Last Updated | 2026-02-13 17:00 GMT |
Overview
QA test script for validating sequence batcher functionality in the Triton Inference Server.
Description
This test validates the Triton Inference Server's sequence batcher, which manages stateful inference requests that must be routed to the same model instance across a sequence of related calls. It verifies correct handling of sequence start and end flags, correlation ID-based routing, sequence slot management, and timeout behavior for idle sequences. The test covers both the direct and oldest scheduling strategies, exercises concurrent sequences across multiple model instances, and validates that sequence state is properly maintained throughout the lifetime of a sequence.
Usage
Run as part of the Triton QA test suite. Requires a GPU-enabled Docker environment with pre-generated test models.
Code Reference
Source Location
- Repository: Triton Inference Server
- File: qa/L0_sequence_batcher/test.sh
- Lines: 1-923
Signature
#!/bin/bash
source ../common/util.sh
# Test orchestration for sequence batcher functionality
Import
source ../common/util.sh
I/O Contract
Inputs
| Name | Type | Required | Description |
|---|---|---|---|
| DATADIR | env var | No | Test data directory |
| MODEL_REPO | directory | Yes | Test model repository |
Outputs
| Name | Type | Description |
|---|---|---|
| exit code | int | 0 on success, 1 on failure |
| test logs | files | Server and test output logs |
Usage Examples
Running the Test
cd qa/L0_sequence_batcher/
bash test.sh