API Reference
Text to Speech
Convert text into natural-sounding audio
POST
Text to Speech
Generate high-quality audio from text using our neural TTS models. This endpoint returns a complete MP3 file after processing your text.
Request
Here’s the simplest way to generate speech:Parameters
Bearer token:
Bearer sk-your-api-keyThe text to convert to speech (max 5,000 characters)
Voice identifier like
lyra, kai, or zara. See all voicesAudio model:
aurora-4 (premium), aurora-3.5 (high quality), turbo-3 (fast), or mini-2 (lightweight)Adjust voice pitch from
-50% to +50%. Default: +0%Speech emotion:
calm, excited, cheerful, or sad. Default: calmStyle intensity from
0.5 to 2.0. Default: 1.5Language code like
en-US, en-GB, fr-FR. Default: en-USResponse
Returns binary MP3 audio data with these headers:Common patterns
Save to file
Add emotion
Handle errors
Text to Speech

