Skip to content

Commit e242dec

Browse files
authored
Merge pull request #151 from DeepLcom/voice-add-audio-formats
[ACL-2127] Add audio format summary to voice entry page
2 parents 3b8f518 + 636db10 commit e242dec

File tree

1 file changed

+17
-1
lines changed

1 file changed

+17
-1
lines changed

api-reference/voice.mdx

Lines changed: 17 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -91,6 +91,22 @@ All source languages can be translated into any target language.
9191
</Columns>
9292
</Accordion>
9393

94+
## Supported Audio Formats
95+
96+
The API supports various common combinations of streaming codecs and containers with a single channel (mono) audio stream.
97+
For a detailed list, please refer to
98+
[Source Media Content Type](/api-reference/voice/get-streaming-url#body-source-media-content-type).
99+
100+
| Audio Codec | Audio Container | Recommended Bitrate |
101+
| :--- | :--- | :--- |
102+
| **PCM** <Icon icon="star"/> | **-** | **256 kbps (16kHz), default recommendation** |
103+
| **OPUS** <Icon icon="star"/> | **Matroska / Ogg / WebM** | **32 kbps, recommended for low bandwidth scenarios** |
104+
| AAC | Matroska | 96 kbps |
105+
| FLAC | FLAC / Matroska / Ogg | 256 kbps (16kHz) |
106+
| MP3 | Matroska / MPEG | 128 kbps |
107+
108+
109+
94110
## Two-Step API Flow
95111

96112
The Voice API uses a two-step flow to initiate streaming.
@@ -170,7 +186,7 @@ sequenceDiagram
170186
* Receiving transcriptions and translations in real-time
171187

172188
<Note>
173-
Once a WebSocket connection is established, you must send audio data to prevent connection closure.
189+
Once a WebSocket connection is established, you must send audio data to prevent connection closure within 30s.
174190
</Note>
175191

176192
See the [WebSocket Streaming](/api-reference/voice/websocket-streaming) documentation for details.

0 commit comments

Comments
 (0)