API Pricing
The Fish Audio API uses pay-as-you-go pricing based on actual usage. There are no subscription fees or monthly minimums for API access.For most recent pricing information, please visit our pricing page.
Text-to-Speech (TTS) Models
TTS pricing is based on the size of input text, measured in millions of UTF-8 bytes.| Model Name | Price (USD) |
|---|---|
speech-1.5 | $15.00 / M UTF-8 bytes |
speech-1.6 | $15.00 / M UTF-8 bytes |
s1 | $15.00 / M UTF-8 bytes |
1M UTF-8 bytes is approximately 180,000 English words, or about 12 hours of speech
Automatic Speech Recognition (ASR) Models
| Model Name | Price (USD) |
|---|---|
transcribe-1 | $0.36 / audio hour |
- Charges are based on the duration of audio processed
- Duration is rounded up to the nearest second
Rate Limits
These limits help us ensure fair usage and maintain service quality for all users.Concurrent Request Limits
| Tier | Spending Threshold | Concurrent Requests |
|---|---|---|
| Starter | < $100 paid | 5 requests |
| Elevated | ≥ $100 paid | 15 requests |
| Enterprise | Custom | Custom limits |
Please reach out to our team to enable enterprise volume pricing, rate limits, and billing.
Support
Need help? Check out these resources:- API Reference - Complete API documentation
- Create a Voice Clone - Create a voice clone model
- Generate Speech - Generate realistic speech
- Real-time Streaming - WebSocket for real-time streaming
- Discord Community - Get help from the community
- Support Email - Contact our support team

