[RFC] 014 - TTS & STT 二期 #480
Replies: 3 comments 1 reply
This comment was marked as off-topic.
This comment was marked as off-topic.
-
marked |
Beta Was this translation helpful? Give feedback.
-
Gemini also has audio input and speech + text output now. Would you consider adding those features? Because of Gemini 2.5 Pro, Gemini has become my default model provider. https://ai.google.dev/api/live If you do implement the voice feature for OpenAI and Gemini, please add a 'hold to speak' button too. One of the most frustrating things with Gemini Mobile on Android is that you can't hold the audio button down. One of the things I recently loved about using Google AI Studio was that you're able to record multiple voice files and then send them all at once. (Sometimes I record a message, then I get interrupted by my son. I can just click stop and start recording another message when I pick up again.) Then I can send them. It's really convenient, and I love it! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Beta Was this translation helpful? Give feedback.
All reactions