voice-recognition

There are 1443 repositories under voice-recognition topic.

PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Language:Python11.3k 185 1.9k1.9k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9.1k 134 1.1k1.4k
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Language:Jupyter Notebook8.3k 119 1.5k1.1k
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Language:Python4.6k 49 250445
theajack/cnchar
🇨🇳 功能全面的汉字工具库 (拼音笔画偏旁成语语音可视化等) (Chinese character util)
Language:TypeScript2.5k 29 112265
coqui-ai/STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Language:C++2.3k 62 183278
collabora/WhisperLive
A nearly-live implementation of OpenAI's Whisper.
Language:Python2.2k 31 187299
react-native-voice/voice
:microphone: React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Language:Objective-C1.9k 35 363501
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
1.8k 41 23229
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
1.3k 57 199141
ggeop/Python-ai-assistant
Python AI assistant 🧠
Language:Python947 43 55247
MycroftAI/mycroft-precise
A lightweight, simple-to-use, RNN wake word listener
Language:Python858 32 190230
yeyupiaoling/VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
Language:Python849 10 69127
alexylem/jarvis
Jarvis.sh is a simple configurable multi-lang assistant.
Language:Shell814 68 934197
EDDiscovery/EDDiscovery
Captains log and 3d star map for Elite Dangerous
Language:C#781 47 1.8k174
Spr-Aachen/Easy-Voice-Toolkit
可本地部署的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
Language:Python693 4 557
WSTxda/Plugin-VoiceGPT
Use ChatGPT voice chat as your device voice assistant
Language:Kotlin693 21 3523
Picovoice/rhino
On-device Speech-to-Intent engine powered by deep learning
Language:Python633 18 12286
evancohen/sonus
:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection
Language:JavaScript631 32 7679
Picovoice/speech-to-text-benchmark
speech to text benchmark framework
Language:Python622 30 1164
Picovoice/cheetah
On-device streaming speech-to-text engine powered by deep learning
Language:Python601 34 8268
Picovoice/picovoice
On-device voice assistant platform powered by deep learning
Language:Python594 20 278110
algolia/voice-overlay-ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Language:Swift547 52 3062
Viral-Doshi/Gesture-Controlled-Virtual-Mouse
Virtually controlling computer using hand-gestures and voice commands. Using MediaPipe, OpenCV Python.
Language:Python526 13 38198
adrianhajdin/project_news_alan_ai
In this video, we're going to build a Conversational Voice Controlled React News Application using Alan AI. Alan AI is a revolutionary speech recognition software that allows you to add voice capabilities to your applications.
Language:JavaScript504 7 9175
Cay-Zhang/SwiftSpeech
A speech recognition framework designed for SwiftUI.
Language:Swift482 10 1659
hackingbeauty/react-mic
Record audio from a user's microphone and display a cool visualization.
Language:JavaScript455 9 88158
hollance/TensorFlow-iOS-Example
Source code for my blog post "Getting started with TensorFlow on iOS"
Language:Swift441 20 1086
Picovoice/leopard
On-device speech-to-text engine powered by deep learning
Language:Python435 19 5127
rcbyron/hey-athena-client
Your personal voice assistant
Language:Python422 53 5398
reriiasu/speech-to-text
Real-time transcription using faster-whisper
Language:HTML421 8 2164
alphacep/vosk
VOSK Speech Recognition Toolkit
Language:C388 29 748
jim-schwoebel/voicebook
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
Language:Python377 25 2585
shamspias/customizable-gpt-chatbot
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.
Language:Python374 12 1580
dictation-toolbox/Caster
Dragonfly-Based Voice Programming and Accessibility Toolkit
Language:Python341 34 417120
Nikorasu/LiveWhisper
A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.
Language:Python336 9 545

voice-recognition

PaddlePaddle/PaddleSpeech

speechbrain/speechbrain

alphacep/vosk-api

snakers4/silero-vad

theajack/cnchar

coqui-ai/STT

collabora/WhisperLive

react-native-voice/voice

jim-schwoebel/voice_datasets

coqui-ai/open-speech-corpora

ggeop/Python-ai-assistant

MycroftAI/mycroft-precise

yeyupiaoling/VoiceprintRecognition-Pytorch

alexylem/jarvis

EDDiscovery/EDDiscovery

Spr-Aachen/Easy-Voice-Toolkit

WSTxda/Plugin-VoiceGPT

Picovoice/rhino

evancohen/sonus

Picovoice/speech-to-text-benchmark

Picovoice/cheetah

Picovoice/picovoice

algolia/voice-overlay-ios

Viral-Doshi/Gesture-Controlled-Virtual-Mouse

adrianhajdin/project_news_alan_ai

Cay-Zhang/SwiftSpeech

hackingbeauty/react-mic

hollance/TensorFlow-iOS-Example

Picovoice/leopard

rcbyron/hey-athena-client

reriiasu/speech-to-text

alphacep/vosk

jim-schwoebel/voicebook

shamspias/customizable-gpt-chatbot

dictation-toolbox/Caster

Nikorasu/LiveWhisper