File tree Expand file tree Collapse file tree 1 file changed +17
-1
lines changed
Expand file tree Collapse file tree 1 file changed +17
-1
lines changed Original file line number Diff line number Diff line change @@ -91,6 +91,22 @@ All source languages can be translated into any target language.
9191 </Columns >
9292</Accordion >
9393
94+ ## Supported Audio Formats
95+
96+ The API supports various common combinations of streaming codecs and containers with a single channel (mono) audio stream.
97+ For a detailed list, please refer to
98+ [ Source Media Content Type] ( /api-reference/voice/get-streaming-url#body-source-media-content-type ) .
99+
100+ | Audio Codec | Audio Container | Recommended Bitrate |
101+ | :--- | :--- | :--- |
102+ | ** PCM** <Icon icon = " star" /> | ** -** | ** 256 kbps (16kHz), default recommendation** |
103+ | ** OPUS** <Icon icon = " star" /> | ** Matroska / Ogg / WebM** | ** 32 kbps, recommended for low bandwidth scenarios** |
104+ | AAC | Matroska | 96 kbps |
105+ | FLAC | FLAC / Matroska / Ogg | 256 kbps (16kHz) |
106+ | MP3 | Matroska / MPEG | 128 kbps |
107+
108+
109+
94110## Two-Step API Flow
95111
96112The Voice API uses a two-step flow to initiate streaming.
@@ -170,7 +186,7 @@ sequenceDiagram
170186 * Receiving transcriptions and translations in real-time
171187
172188 <Note >
173- Once a WebSocket connection is established, you must send audio data to prevent connection closure.
189+ Once a WebSocket connection is established, you must send audio data to prevent connection closure within 30s .
174190 </Note >
175191
176192 See the [ WebSocket Streaming] ( /api-reference/voice/websocket-streaming ) documentation for details.
You can’t perform that action at this time.
0 commit comments