FishAudioTTSService, which provides real-time text-to-speech synthesis using WebSocket streaming for low-latency conversational applications.
Prerequisites
- A Fish Audio account with an API key
- Python 3.9 or higher
Installation
Install Pipecat with Fish Audio support:Configuration
Set your Fish Audio API key as an environment variable:Basic usage
AddFishAudioTTSService to your Pipecat pipeline:
Key parameters
| Parameter | Description |
|---|---|
api_key | Your Fish Audio API key |
reference_id | Voice model ID from the Fish Audio library |
model_id | TTS model version (default: s1) |
output_format | Audio format: pcm, mp3, wav, or opus |
Prosody controls
Customize speech characteristics withInputParams:



