OpenAPI v1
Text to Speech
Convert text to speech
POST
This endpoint only accepts application/json
and application/msgpack
.
For best results, upload reference audio using the create model before using this one. This improves speech quality and reduces latency.
To upload audio clips directly, without pre-uploading, serialize the request body with MessagePack as per the instructions.
Audio formats supported:
- WAV / PCM
- Sample Rate: 8kHz, 16kHz, 24kHz, 32kHz, 44.1kHz
- Default Sample Rate: 44.1kHz
- 16-bit, mono
- MP3
- Sample Rate: 32kHz, 44.1kHz
- Default Sample Rate: 44.1kHz
- mono
- Bitrate: 64kbps, 128kbps (default), 192kbps
- Opus
- Sample Rate: 48kHz
- Default Sample Rate: 48kHz
- mono
- Bitrate: -1000 (auto), 24kbps, 32kbps (default), 48kbps, 64kbps
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Headers
Specify which TTS model to use
Available options:
speech-1.5
, speech-1.6
Body
Response
200
Request fulfilled, document follows