Principle:Groq Groq python Speech Request Configuration
| Knowledge Sources | |
|---|---|
| Domains | Audio, Text_To_Speech |
| Last Updated | 2026-02-15 16:00 GMT |
Overview
The process of configuring text-to-speech request parameters including input text, voice selection, model choice, and audio output format.
Description
Speech Request Configuration involves preparing the parameters needed for a text-to-speech synthesis request. This includes selecting the TTS model, choosing a voice identity, providing the input text, and optionally configuring the audio output format, sample rate, and speech speed.
Key configuration points:
- Model selection: Choose between available TTS models (e.g., playai-tts, playai-tts-arabic)
- Voice selection: Specify a voice ID that determines the speaker characteristics
- Audio format: Output as flac, mp3, mulaw, ogg, or wav
- Sample rate: Control audio quality from 8000 to 48000 Hz
- Speed: Adjust playback speed of generated speech
Usage
Use this principle when preparing a text-to-speech request. The three required parameters are input text, model, and voice. Audio format and sample rate are optional with sensible defaults.
Theoretical Basis
# Abstract TTS configuration
config = {
"input": text_to_speak,
"model": tts_model,
"voice": voice_id,
"response_format": "wav", # Optional
"sample_rate": 24000, # Optional
"speed": 1.0, # Optional
}