lbehringer's Stars
junegunn/fzf
:cherry_blossom: A command-line fuzzy finder
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
ajeetdsouza/zoxide
A smarter cd command. Supports all major shells.
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
py-pdf/pypdf
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
enhuiz/vall-e
An unofficial PyTorch implementation of the audio LM VALL-E
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
jim-schwoebel/voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
coqui-ai/open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
YannickJadoul/Parselmouth
Praat in Python, the Pythonic way
NATSpeech/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
synesthesiam/opentts
Open Text to Speech Server
lhotse-speech/lhotse
Tools for handling speech data in machine learning projects.
dmort27/epitran
A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
Kyubyong/css10
CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages
audiolabs/webMUSHRA
a MUSHRA compliant web audio API based experiment software
liusongxiang/ppg-vc
PPG-Based Voice Conversion
lingjzhu/CharsiuG2P
Multilingual G2P in 100 languages
yangdongchao/InstructTTS
The deme page of InstructTTS
janfreyberg/pytorch-revgrad
A minimal pytorch package implementing a gradient reversal layer.
xinjli/transphone
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
guanlongzhao/fac-via-ppg
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
DigitalPhonetics/speaker-anonymization
Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.
Selimonder/birdclef-2022
carlfm01/librivox-tools
Collector and speech cutter for librivox audiobooks
Bartelds/acoustic-distance-measure
Acoustic distance measure for comparing pronunciations
cldf-clts/clts
Cross-Linguistic Transcription Systems
Flux9665/TTSCorpusCreator
A tool that makes creating text-to-speech corpora easier.