speechtotext
There are 76 repositories under speechtotext topic.
Azure-Samples/SpeechToText-WebSockets-Javascript
SDK & Sample to do speech recognition using websockets in Javascript
damianFC/alexa-rubykit
Amazon Echo Alexa's App Kit Ruby Implementation
hiteshsahu/Android-TTS-STT
One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
sovoid/Friend.ly
A social media platform with a friend recommendation engine based on personality trait extraction
kaloprojects/KALO-ESP32-Voice-Assistant
Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText (STT) API, how to generate audio from text via TextToSpeech (TTS) API from OpenAI a/o SpeechGen a/o Google TTS. Triggering ESP32 actions via Voice.
mrzaizai2k/stock_price_4_fun
Mrzaizai2k Stock Assistant Bot: Your all-in-one stock analysis companion. Calculate payback time, find support/resistance, and receive market warnings.
microsoft/glue
GLUE is a lightweight, Python-based collection of scripts to support you at succeeding with speech and text use-cases based on Microsoft Azure Cognitive Services.
AndroidCodility/SpeechToText
Android application to text through which you can provide speech input to your app using Kotlin Programming Language.
sdsb8432/SpeechToText-Android
Speech to Text with Google for Android
ThisIsNSH/Android-CatchMusic
This project displays full song metadata and lyrics by just inputting a few words from the song. Along with that, they see metadata of other songs in the album and songs from the same singer. Users can also listen to the song directly on Youtube. Not all data shown is free but is fetched for free by using HTML page parsing in Android.
Jakevin/Speech-OpenAI
利用瀏覽器的原生語音輸入與輸出,達到語音對話功能。ブラウザのネイティブ音声入出力を利用し、音声による会話を実現。
olololoe110399/mikasa_gpt
🚀 MiksaGPT, part of the 'Miksa' project, is a groundbreaking voice assistant utilizing Claude 3 and APIs from 'anthropic' and 'elevenlabs'. It enables real-time Opus two-way voice chat with seamless interruptibility, built with Flutter and available for free on GitHub.
Excalib88/AudioTextR
🗣 🎤 ✉️ Repo which convert audio messages to text (for example telegram audio messages etc)
kaloprojects/KALO-ESP32-Voice-ChatGPT
ESP32-based Open AI Voice chat device (similar ChatGPT). Recording questions with a microphone, transcribing via Deepgram STT, then sent to Open AI. Response is played with AI voices on speaker. Supporting ongoing dialogues with saved history for follow-up questions. User defined "system prompts" for own "personalities" and dedicated use cases.
pranayjoshi/speech_to_text
This is a speech_to_text script by Pranay Joshi
ArshTiwari2004/Recap
Your smart companion for smarter learning
Ashishkumar-hub/Speech-to-text-using-speech_recognition-
In this project our goal is to acheive the problem of converting audio data into textual data.
Ashot72/Speech-to-Text-to-Image
Generating texts from your voice then images form the texts
darshanc99/TranslateThis
TranslateThis: Android App developed for MCAN Lab
keiffster/talk-y
A speech to text and text to speech client for Program-Y chatbot framework
VibhinnS/Doraemon
A web based platform that lets you control your DJI Tello drone via speech inputs and gestures through UDP protocol. Utilises computer vision, networking, mediapipe and ANNs
0xBitBuster/talking-chatgpt-extension
ChatGPT Speech-To-Text Extension using Javascript
LazarenkoA/TelegramVoiceToText
Преобразование голосовых сообщений telegram (личных, не в чатах) в текст
MANISH007700/IBM_Watson_Speech_to_Text
Leveraging the services of IBM Watson to convert the Real-Time Input Speech to Text
umair13adil/background_tts_stt
A flutter project to run speech recognition service in background.
Amanbig/meetings_app
This project is a Meetings Summarizer built with FastAPI on the backend and a React frontend. It allows users to upload video or audio files, transcribe content, summarize it, ask contextual questions, and convert summaries to audio.
aniruddhaadak9/VoiceMath
Voice Math: An interactive and engaging math quiz application designed to improve mental math skills 🧠.
App-Lobby/StoryListener
I were created this application to learn and practice the handling AVFoundation and Speech Frameworks. Its build in Swift 5 with SwiftUI.
ArkS0001/IIT-Bombay-Whisper-Hindi-ASR-Model-Machine-Learning-Intern
Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise and technical language. Moreover, it enables transcription in multiple languages, as
development-community/Integrate-GPT-on-App
Source code of "ChatGPT at your service" on "ImagineAndMake" serie
LazarenkoA/SpeechToTxt
аудио сообщения в текст, используется API yandex. Speech to text, using the yandex API
OctaneDevInd/ProjectZia
An experimental application that enables hearing and speech impaired people to communicate through sign language.
Sommie09/HydraMail
This is a voice-based email application that enables the visually impaired send mails with just voice commands. Implemented using TTS and STT libraries, Android Accessibility Features and SMPT Libraries
tauseeqq/Carsoulai
Description : 'CarSoul-AI' is a Python program blending vehicle diagnostics with integrated ChatGPT. Using voice commands, access real-time car data, clear error codes, or talk to gpt-4 powerd ai for solutions..
YashPimpalkar/ModernDictionary
This is Modern Dictinary using web Scraping with django it gives meaning ,synonyms ,antonyms ,example of a searched word it also has chatbot ,jasascript games ,blogs with google translator api
FGonzalesc/Transcripcion_AI
Transcripción de audios con Azure Speech y extracción de insights con Open AI