Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
-
Updated
Feb 12, 2025 - C++
Offline Speech Recognition with OpenAI Whisper and TensorFlow Lite for Android
Generate TikTok Text-to-Speech voices in your browser
Text to speech package for Golang.
I will share about Machine Learning and Deep Learning.
Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]
A simple tool to demo text-to-speech using various services' voices. HTML5 and Vanilla JS.
Text to Speech NativeScript plugin for Android & iOS 📢
A ComfyUI node for Maya1, a 3B-parameter speech model built for expressive voice generation with rich human emotion and precise voice design.
This repo is text to speech with learnable audio encoder without alignment with transcript reference
Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText (STT) API, how to generate audio from text via TextToSpeech (TTS) API from OpenAI a/o SpeechGen a/o Google TTS. Triggering ESP32 actions via Voice.
The only Text to Speech Telegram Inline Bot
Whooby is a text-to-speech android application to communicate within a group or community .
Text To Speech Demo in ReactJS Application using Azure Avatar AI Service.
ESP32-based voice device for chatting with multiple custom AI bots. Recording questions with I2S microphone, transcribing via ElevenLabs or Deepgram STT, creating response with Groq or Open AI LLM. TTS audio output with custom AI voices via I2S & speaker. Supporting ongoing dialogues, calling bots ‘by name’, real-time web search via keyword.
Simple application for continuous speach to text without google dialog
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to speech .
Explore AI Capabilities for Your .NET Projects with OpenAI's API: Unlock the power of AI in your applications
ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice cloning and audio editing with emotion, style, speed control, and more.
🦸🏻♂️🎺 Talkify is a comprehensive, cross-platform Swift library for adding advanced speech features to your applications. It efficiently manages voice-to-text and text-to-speech capabilities using the power of AVFoundation and Speech frameworks.
Add a description, image, and links to the texttospeech topic page so that developers can more easily learn about it.
To associate your repository with the texttospeech topic, visit your repo's landing page and select "manage topics."