cURL
curl --request POST \ --url https://api.fish.audio/v1/asr \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: multipart/form-data' \ --form audio='@example-file' \ --form 'language=<string>' \ --form ignore_timestamps=true
{ "text": "<string>", "duration": 123, "segments": [ { "text": "<string>", "start": 123, "end": 123 } ] }
Transcribe audio to text
application/form-data
application/msgpack
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Bearer <token>
<token>
Audio to be converted to text
Language to be used for the speech
Whether to return precise timestamps in the text, this will increase the latency in audio shorter than 30 seconds
Request fulfilled, document follows
Duration of the audio in seconds
Show child attributes
Was this page helpful?