Skip to main content

Purpose

This page is the recommended starting point for AI agents, RAG pipelines, and documentation crawlers that need accurate Fish Audio references with minimal markup noise.

Retrieval Order

  1. Read llms.txt for the curated documentation index.
  2. Read llms-full.txt when broad site context is needed.
  3. Read OpenAPI for REST schemas, parameters, and examples.
  4. Read AsyncAPI for the WebSocket streaming protocol.
  5. Fetch individual .md pages only after narrowing to a specific task.

Canonical API Facts

  • Base API URL: https://api.fish.audio
  • Authentication: Authorization: Bearer <FISH_API_KEY>
  • TTS model selection: send a required model header. Recommended default: s2-pro
  • Main REST endpoints:
    • POST /v1/tts
    • POST /v1/asr
    • GET /model
    • POST /model
    • GET /model/{id}
    • PATCH /model/{id}
    • DELETE /model/{id}
  • Real-time streaming endpoint: wss://api.fish.audio/v1/tts/live

High-Value URLs

Start Here

API Specs

Authentication And SDK Setup

Core Product Tasks

Real-Time And Integrations

Models, Pricing, And Lifecycle

Task Routing

  • If the task is “generate speech”, start with Quick Start, the Text to Speech guide, and POST /v1/tts.
  • If the task is “transcribe audio”, start with the Speech to Text guide and POST /v1/asr.
  • If the task is “clone or manage voices”, start with Creating Voice Models and the /model endpoints.
  • If the task is “stream audio in real time”, start with AsyncAPI, WebSocket TTS Streaming, and the WebSocket SDK guides.
  • If the task is “pick the right model or estimate cost”, start with Models Overview and Pricing And Rate Limits.

Notes For Agents

  • Prefer openapi.json and asyncapi.yml for machine-readable schemas.
  • Prefer .md URLs when you need a single human-authored page in Markdown form.
  • Some richer pages use interactive MDX widgets. If a fetched page contains UI or component noise, fall back to this page, llms.txt, llms-full.txt, or the API spec files first.
  • Treat this page as the canonical low-noise entry point for Fish Audio documentation retrieval.