Create speech
Synthesizes audio from the input text. Returns a raw audio bytestream in the requested format (e.g. mp3, pcm, wav).
Authorizations
API key as bearer token in Authorization header
Body
Text-to-speech request input
Text to synthesize
"Hello world"
TTS model identifier
"elevenlabs/eleven-turbo-v2"
Voice identifier (provider-specific).
"alloy"
Provider-specific passthrough configuration
Audio output format
mp3, pcm "pcm"
Playback speed multiplier. Only used by models that support it (e.g. OpenAI TTS). Ignored by other providers.
1
Response
Audio bytes stream
Raw audio bytestream. Content-Type varies by requested format (audio/mpeg for mp3, audio/pcm for pcm — 16-bit little-endian).