Pinned Repositories
acoustic-model
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
balena-pi4
CoMoSpeech
one-step diffusion based speech synthesis
eschmidbauer.github.io
FlexFlow
A distributed deep learning framework.
flutter_sherpa_onnx
Flutter plugin wrapping the Sherpa-ONNX runtime
freeswitch
FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile software implementation that runs on any commodity hardware. From a Raspberry PI to a multi-core server, FreeSWITCH can unlock the telecommunications potential of any device.
goesl
Freeswitch Event Socket Library wrapper for Go
voicefixer
General Speech Restoration
websocket-audio-stream
pyaudio & websocket to stream real-time audio to speakers
eschmidbauer's Repositories
eschmidbauer/websocket-audio-stream
pyaudio & websocket to stream real-time audio to speakers
eschmidbauer/voicefixer
General Speech Restoration
eschmidbauer/acoustic-model
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
eschmidbauer/balena-pi4
eschmidbauer/CoMoSpeech
one-step diffusion based speech synthesis
eschmidbauer/eschmidbauer.github.io
eschmidbauer/FlexFlow
A distributed deep learning framework.
eschmidbauer/flutter_sherpa_onnx
Flutter plugin wrapping the Sherpa-ONNX runtime
eschmidbauer/freeswitch
FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile software implementation that runs on any commodity hardware. From a Raspberry PI to a multi-core server, FreeSWITCH can unlock the telecommunications potential of any device.
eschmidbauer/goesl
Freeswitch Event Socket Library wrapper for Go
eschmidbauer/greenswitch
Battle proven FreeSWITCH Event Socket Protocol client implementation with Gevent
eschmidbauer/kamailio
Kamailio - The Open Source SIP Server
eschmidbauer/metaseq
Repo for external large-scale work
eschmidbauer/peerless
Peerless Animate API
eschmidbauer/rtpbreak
Saved for posterity.
eschmidbauer/Speech-Backbones
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
eschmidbauer/MeloTTS
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
eschmidbauer/mod_audio_stream
FreeSWITCH module to stream audio to websocket and receive response
eschmidbauer/mod_vad
a voice activity detection module for freeswitch.
eschmidbauer/mod_whisper_asr
Freeswitch ASR module to working with whisper_cpp
eschmidbauer/NeMo-text-processing
NeMo text processing for ASR and TTS
eschmidbauer/pkg-kamailio-docker
Docker files to easily build Kamailio on different Debian/Ubuntu releases
eschmidbauer/RAD-MMM
A TTS model that makes a speaker speak new languages
eschmidbauer/RVC_CLI
RVC CLI enables seamless interaction with Retrieval-based Voice Conversion through commands or HTTP requests.
eschmidbauer/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
eschmidbauer/whisper-cpp-server
whisper-cpp-server
eschmidbauer/whisperd
Unified API for various whisper implementations
eschmidbauer/X-E-Speech-code
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion