eziolotta's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
spotify/annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
satellite-image-deep-learning/techniques
Techniques for deep learning with satellite & aerial imagery
alphacep/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
TalAter/annyang
💬 Speech recognition for your site
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
marqo-ai/marqo
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
coqui-ai/STT
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
MontrealCorpusTools/Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
juanmc2005/diart
A python package to build AI-powered real-time audio applications
botfront/rasa-webchat
A feature-rich chat widget for Rasa and Botfront
rodrigopivi/Chatito
🎯🗯 Dataset generation for AI chatbots, NLP tasks, named entity recognition or text classification models using a simple DSL!
Stypox/dicio-android
Dicio assistant app for Android
markovka17/dla
Deep learning for audio processing
facebookresearch/voxpopuli
A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation
yaroslavvb/tensorflow-community-wheels
Place to upload links to TensorFlow wheels
ai-forever/mgpt
Multilingual Generative Pretrained Model
castorini/howl
Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.
harvard-edge/multilingual_kws
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
zhenghuatan/rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
zhenghuatan/rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
MozillaItalia/DeepSpeech-Italian-Model
Tooling for producing Italian model (public release available) for DeepSpeech and text corpus
castorini/honkling
Web app for keyword spotting using TensorflowJS
suzuki256/dog-dataset
alefiury/multilingual_kws_pytorch
Unofficial PyTorch implementation of Few-Shot Keyword Spotting in Any Language. A model for few-shot keyword spotting in any language, trained with the Multilingual Spoken Words Corpus.
dag7dev/another-one-the-game
game inspired by a popular Italian quiz tv show - entry for deepspeech-italian-contest
cnheider/rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
IlGalvo/N2W-IT