Pinned Repositories
ona
An online virtual assistant in Catalan based on Mycroft. Visit https://ona.assistent.cat to try it out
catalan-speech-recognition-benchmark
A benchmark of speech recognition solutions for the Catalan language
deepspeech-catala
Deepspeech ASR Model for the Catalan Language
LocalSTT
Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
OpenVoiceOS
OpenVoiceOS is a minimalistic linux OS bringing the open source voice assistant Mycroft A.I. to embbeded, low-spec headless and/or small (touch)screen devices.
rasa-nlu-microservice
A microservice with caching support for parsing text and training models with Rasa NLU
spacy-catala
Spacy NLP Model for the Catalan language
vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
wav2vec2-catala
Wav2Vec 2.0 catalan training scripts and models
wav2vec2-service
ccoreilly's Repositories
ccoreilly/vosk-browser
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
ccoreilly/LocalSTT
Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech
ccoreilly/wav2vec2-service
ccoreilly/espeak-ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
ccoreilly/NetflixEnCatala
Extensió pel Chrome que automàticament silencia l'àudio de Netflix i reprodueix el doblatge en català.
ccoreilly/robust-wav2vec2-sprint
ccoreilly/captioner
Generate subtitles of videos in the browser
ccoreilly/clapack-wasm
ccoreilly/commonvoice-utils
Linguistic processing for Common Voice
ccoreilly/conv_ssl
ccoreilly/coqui-ai-tensorflow
An Open Source Machine Learning Framework for Everyone
ccoreilly/datapipe
An audio ETL pipeline for generating datasets from youtube sources
ccoreilly/Federated-learning-ASR
ccoreilly/jocsdemots
ccoreilly/kaldi
ccoreilly/QuickVC-VoiceConversion
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
ccoreilly/raspberry-pi-pwm-fan-control
raspberry pi pwm fan control
ccoreilly/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
ccoreilly/speechbrain
A PyTorch-based Speech Toolkit
ccoreilly/spksrc
Cross compilation framework to create native packages for the Synology's NAS
ccoreilly/streaming-source-separation
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
ccoreilly/STT
The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
ccoreilly/telegram-deepspeech-bot
A Telegram bot that infers text from voice notes using DeepSpeech
ccoreilly/tevr-asr-tool
State-of-the-art (ranked #1 Aug 2022) German Speech Recognition in 284 lines of C++. This is a 100% private 100% offline 100% free CLI tool.
ccoreilly/text2lang
Language detection api based on ivanlau/language-detection-fine-tuned-on-xlm-roberta-base
ccoreilly/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
ccoreilly/VoiceActivityProjection
Voice Activity Projection Models: Self-supervised learning of Turn-taking Events
ccoreilly/vosk-api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
ccoreilly/vosk-server
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
ccoreilly/vscode-audio-preview
VS Code Extension to preview and play wav file.