Jump to content

Connect SuperML | Leeroopedia MCP: Equip your AI agents with best practices, code verification, and debugging knowledge. Powered by Leeroo — building Organizational Superintelligence. Contact us at founders@leeroo.com.

Implementation:Elevenlabs Elevenlabs python TtsConversationalConfigInput

From Leeroopedia
Field Value
source Elevenlabs_Elevenlabs_python
domains Conversational AI, Text-to-Speech, Voice Configuration, Audio
last_updated 2026-02-15

Overview

Description

TtsConversationalConfigInput is a Pydantic model that defines the Text-to-Speech (TTS) configuration for a conversational AI agent in the ElevenLabs platform. It provides comprehensive control over voice synthesis parameters including the TTS model, voice selection, audio format, streaming latency optimization, speech stability, speed, similarity boost, text normalization, and pronunciation dictionaries.

This model is auto-generated by Fern from the ElevenLabs API definition and inherits from UncheckedBaseModel. It is a key component in controlling the audio output quality and characteristics of conversational agents.

Usage

TtsConversationalConfigInput is used within the conversational configuration of an agent to define how the agent's speech is synthesized. It allows developers to select voices, tune voice parameters, configure audio formats for different deployment scenarios, and optimize for streaming latency.

Code Reference

Source Location

src/elevenlabs/types/tts_conversational_config_input.py

Class Signature

class TtsConversationalConfigInput(UncheckedBaseModel):
    ...

Import Statement

from elevenlabs.types import TtsConversationalConfigInput

I/O Contract

Field Type Required Description
model_id Optional[TtsConversationalModel] No The model to use for TTS.
voice_id Optional[str] No The voice ID to use for TTS.
supported_voices Optional[List[SupportedVoice]] No Additional supported voices for the agent.
suggested_audio_tags Optional[List[SuggestedAudioTag]] No Suggested audio tags to boost expressive speech (for eleven_v3 and eleven_v3_conversational models). The agent can still use other tags not listed here.
agent_output_audio_format Optional[TtsOutputFormat] No The audio format to use for TTS.
optimize_streaming_latency Optional[TtsOptimizeStreamingLatency] No The optimization for streaming latency.
stability Optional[float] No The stability of generated speech.
speed Optional[float] No The speed of generated speech.
similarity_boost Optional[float] No The similarity boost for generated speech.
text_normalisation_type Optional[TextNormalisationType] No Method for converting numbers to words before TTS. SYSTEM_PROMPT updates the system prompt with normalization instructions. ELEVENLABS normalizes text after generation with slight additional latency.
pronunciation_dictionary_locators Optional[List[PydanticPronunciationDictionaryVersionLocator]] No The pronunciation dictionary locators.

Usage Examples

Basic TTS Configuration

from elevenlabs.types import TtsConversationalConfigInput

tts_config = TtsConversationalConfigInput(
    voice_id="voice_abc123",
    stability=0.5,
    similarity_boost=0.75,
    speed=1.0,
)

Advanced TTS with Model and Format Selection

from elevenlabs.types import TtsConversationalConfigInput

tts_config = TtsConversationalConfigInput(
    model_id="eleven_turbo_v2",
    voice_id="voice_abc123",
    agent_output_audio_format="pcm_16000",
    optimize_streaming_latency=3,
    stability=0.6,
    similarity_boost=0.8,
    speed=1.1,
)

Multi-Voice Agent Configuration

from elevenlabs.types import TtsConversationalConfigInput, SupportedVoice

tts_config = TtsConversationalConfigInput(
    voice_id="primary_voice_id",
    supported_voices=[
        SupportedVoice(voice_id="voice_en_us", language="en"),
        SupportedVoice(voice_id="voice_es_mx", language="es"),
    ],
    stability=0.5,
    similarity_boost=0.75,
)

Related Pages

Page Connections

Double-click a node to navigate. Hold to expand connections.
Principle
Implementation
Heuristic
Environment