speechtotext

There are 76 repositories under speechtotext topic.

  • Azure-Samples/SpeechToText-WebSockets-Javascript

    SDK & Sample to do speech recognition using websockets in Javascript

    Language:TypeScript2193865150
  • damianFC/alexa-rubykit

    Amazon Echo Alexa's App Kit Ruby Implementation

    Language:Ruby15718456
  • hiteshsahu/Android-TTS-STT

    One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem

    Language:Kotlin1075940
  • sovoid/Friend.ly

    A social media platform with a friend recommendation engine based on personality trait extraction

    Language:JavaScript533625
  • kaloprojects/KALO-ESP32-Voice-Assistant

    Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText (STT) API, how to generate audio from text via TextToSpeech (TTS) API from OpenAI a/o SpeechGen a/o Google TTS. Triggering ESP32 actions via Voice.

    Language:C++28248
  • mrzaizai2k/stock_price_4_fun

    Mrzaizai2k Stock Assistant Bot: Your all-in-one stock analysis companion. Calculate payback time, find support/resistance, and receive market warnings.

    Language:Jupyter Notebook251212
  • microsoft/glue

    GLUE is a lightweight, Python-based collection of scripts to support you at succeeding with speech and text use-cases based on Microsoft Azure Cognitive Services.

    Language:Jupyter Notebook202736
  • AndroidCodility/SpeechToText

    Android application to text through which you can provide speech input to your app using Kotlin Programming Language.

    Language:Kotlin17005
  • sdsb8432/SpeechToText-Android

    Speech to Text with Google for Android

    Language:Java9128
  • ThisIsNSH/Android-CatchMusic

    This project displays full song metadata and lyrics by just inputting a few words from the song. Along with that, they see metadata of other songs in the album and songs from the same singer. Users can also listen to the song directly on Youtube. Not all data shown is free but is fetched for free by using HTML page parsing in Android.

    Language:Java9074
  • Jakevin/Speech-OpenAI

    利用瀏覽器的原生語音輸入與輸出,達到語音對話功能。ブラウザのネイティブ音声入出力を利用し、音声による会話を実現。

    Language:Vue6109
  • olololoe110399/mikasa_gpt

    🚀 MiksaGPT, part of the 'Miksa' project, is a groundbreaking voice assistant utilizing Claude 3 and APIs from 'anthropic' and 'elevenlabs'. It enables real-time Opus two-way voice chat with seamless interruptibility, built with Flutter and available for free on GitHub.

    Language:Dart6102
  • Excalib88/AudioTextR

    🗣 🎤 ✉️ Repo which convert audio messages to text (for example telegram audio messages etc)

    Language:C#5100
  • kaloprojects/KALO-ESP32-Voice-ChatGPT

    ESP32-based Open AI Voice chat device (similar ChatGPT). Recording questions with a microphone, transcribing via Deepgram STT, then sent to Open AI. Response is played with AI voices on speaker. Supporting ongoing dialogues with saved history for follow-up questions. User defined "system prompts" for own "personalities" and dedicated use cases.

    Language:C++5210
  • pranayjoshi/speech_to_text

    This is a speech_to_text script by Pranay Joshi

    Language:Python5201
  • ArshTiwari2004/Recap

    Your smart companion for smarter learning

    Language:JavaScript4003
  • Ashishkumar-hub/Speech-to-text-using-speech_recognition-

    In this project our goal is to acheive the problem of converting audio data into textual data.

    Language:HTML410
  • Ashot72/Speech-to-Text-to-Image

    Generating texts from your voice then images form the texts

    Language:JavaScript4210
  • darshanc99/TranslateThis

    TranslateThis: Android App developed for MCAN Lab

    Language:Java4100
  • keiffster/talk-y

    A speech to text and text to speech client for Program-Y chatbot framework

    Language:Python4410
  • VibhinnS/Doraemon

    A web based platform that lets you control your DJI Tello drone via speech inputs and gestures through UDP protocol. Utilises computer vision, networking, mediapipe and ANNs

    Language:Python4103
  • 0xBitBuster/talking-chatgpt-extension

    ChatGPT Speech-To-Text Extension using Javascript

    Language:JavaScript3100
  • LazarenkoA/TelegramVoiceToText

    Преобразование голосовых сообщений telegram (личных, не в чатах) в текст

    Language:Go310
  • MANISH007700/IBM_Watson_Speech_to_Text

    Leveraging the services of IBM Watson to convert the Real-Time Input Speech to Text

    Language:Python3110
  • umair13adil/background_tts_stt

    A flutter project to run speech recognition service in background.

    Language:Java3101
  • Amanbig/meetings_app

    This project is a Meetings Summarizer built with FastAPI on the backend and a React frontend. It allows users to upload video or audio files, transcribe content, summarize it, ask contextual questions, and convert summaries to audio.

    Language:JavaScript21
  • aniruddhaadak9/VoiceMath

    Voice Math: An interactive and engaging math quiz application designed to improve mental math skills 🧠.

    Language:TypeScript2100
  • App-Lobby/StoryListener

    I were created this application to learn and practice the handling AVFoundation and Speech Frameworks. Its build in Swift 5 with SwiftUI.

    Language:Swift2100
  • ArkS0001/IIT-Bombay-Whisper-Hindi-ASR-Model-Machine-Learning-Intern

    Whisper is an automatic speech recognition (ASR) system trained on 680,000 hours of multilingual and multitask supervised data collected from the web. We show that the use of such a large and diverse dataset leads to improved robustness to accents, background noise and technical language. Moreover, it enables transcription in multiple languages, as

    Language:Jupyter Notebook2103
  • development-community/Integrate-GPT-on-App

    Source code of "ChatGPT at your service" on "ImagineAndMake" serie

    Language:Python2000
  • LazarenkoA/SpeechToTxt

    аудио сообщения в текст, используется API yandex. Speech to text, using the yandex API

    Language:Go210
  • OctaneDevInd/ProjectZia

    An experimental application that enables hearing and speech impaired people to communicate through sign language.

    Language:C#2100
  • Sommie09/HydraMail

    This is a voice-based email application that enables the visually impaired send mails with just voice commands. Implemented using TTS and STT libraries, Android Accessibility Features and SMPT Libraries

    Language:Java2212
  • tauseeqq/Carsoulai

    Description : 'CarSoul-AI' is a Python program blending vehicle diagnostics with integrated ChatGPT. Using voice commands, access real-time car data, clear error codes, or talk to gpt-4 powerd ai for solutions..

    Language:Python2200
  • YashPimpalkar/ModernDictionary

    This is Modern Dictinary using web Scraping with django it gives meaning ,synonyms ,antonyms ,example of a searched word it also has chatbot ,jasascript games ,blogs with google translator api

    Language:HTML2100
  • FGonzalesc/Transcripcion_AI

    Transcripción de audios con Azure Speech y extracción de insights con Open AI

    Language:Python1