WebSocket Streaming

Prerequisites

Create a Fish Audio account

Go to fish.audio/auth/signup
Fill in your details to create an account, complete steps to verify your account.
Log in to your account and navigate to the API section

Get your API key

Once you have an account, you’ll need an API key to authenticate your requests.

Log in to your Fish Audio Dashboard
Navigate to the API Keys section
Click “Create New Key” and give it a descriptive name, set a expiration if desired
Copy your key and store it securely

Keep your API key secret! Never commit it to version control or share it publicly.

Overview

Use stream_websocket() for real-time text streaming with LLMs and live captions. The connection automatically buffers incoming text and generates audio as it becomes available.

Basic Usage

Stream text chunks and receive audio in real-time:

from fishaudio import FishAudio
from fishaudio.utils import play

client = FishAudio()

# Define text generator
def text_chunks():
    yield "Hello, "
    yield "this is "
    yield "real-time "
    yield "streaming!"

# Stream audio via WebSocket
audio_stream = client.tts.stream_websocket(
    text_chunks(),
    latency="balanced"  # Use "balanced" for real-time, "normal" for quality
)

# Play streamed audio
play(audio_stream)

For details on audio formats, voice selection, and advanced configuration options like TTSConfig, see the Text-to-Speech guide.

Using FlushEvent

Force immediate audio generation to create pauses using FlushEvent:

from fishaudio import FishAudio
from fishaudio.types import FlushEvent

client = FishAudio()

def text_with_flush():
    yield "First sentence. "
    yield "Second sentence. "
    yield FlushEvent()  # Forces generation NOW
    yield "Third sentence."

audio_stream = client.tts.stream_websocket(text_with_flush())

See Text-to-Speech guide for detailed FlushEvent usage and advanced examples.

LLM Integration

WebSocket streaming is designed for integrating with LLM streaming responses. The TTS engine automatically buffers incoming text chunks and generates audio when it has enough context for natural speech:

from fishaudio import FishAudio
from fishaudio.utils import play

client = FishAudio()

# Simulate streaming LLM response
def llm_stream():
    """Simulates text chunks from an LLM."""
    tokens = [
        "The ", "weather ", "today ", "is ", "sunny ",
        "with ", "clear ", "skies. ", "Perfect ",
        "for ", "outdoor ", "activities!"
    ]
    for token in tokens:
        yield token

# Stream to speech in real-time
audio_stream = client.tts.stream_websocket(
    llm_stream(),
    latency="balanced"
)
play(audio_stream)

The WebSocket connection automatically buffers incoming text and generates audio when it has accumulated enough context for natural-sounding speech. You don’t need to manually batch tokens unless you want to force generation at specific points using FlushEvent.

Next Steps

Text-to-Speech

Learn about non-streaming TTS options, audio formats, TextEvent vs plain strings, and advanced configuration

Voice Cloning

Use custom voices in streams and learn about voice selection

TTS API Reference

Complete streaming API documentation

Best Practices

Production streaming optimization

WebSocket Types - TextEvent, FlushEvent, and more
Utils Reference - Audio playback utilities
Error Handling - WebSocket exception handling
Fine-grained Control - Advanced speech control

Getting Started

Models & Pricing

Core Features

Developer SDKs

Best Practices

Product Guides

Self-Hosting

Integrations

Tutorials

Resources

Prerequisites

Overview

Basic Usage

Using FlushEvent

LLM Integration

Next Steps

Text-to-Speech

Voice Cloning

TTS API Reference

Best Practices

Getting Started

Models & Pricing

Core Features

Developer SDKs

Best Practices

Product Guides

Self-Hosting

Integrations

Tutorials

Resources

​Prerequisites

​Overview

​Basic Usage

​Using FlushEvent

​LLM Integration

​Next Steps

Text-to-Speech

Voice Cloning

TTS API Reference

Best Practices

​Related Resources

Prerequisites

Overview

Basic Usage

Using FlushEvent

LLM Integration

Next Steps

Related Resources