michael-kuhlmann
PhD Student at Paderborn University voice conversion, speech synthesis, voice profiling
Paderborn UniversityPaderborn
michael-kuhlmann's Stars
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
hechmik/voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
ankitshah009/awesome-terminal-hacks
A repository consisting of useful terminal commands required in daily tasks to reduce stackoverflow searches.
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.
HobbitLong/SupContrast
PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
jitsi/jiwer
Evaluate your speech-to-text system with similarity measures such as word error rate (WER)
isca-mentoring/mentoring_database
DmitryRyumin/INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
jbhuang0604/awesome-tips
RF5/simple-asgan
Training code and trained checkpoints for ASGAN.
winddori2002/TriAAN-VC
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
YannDubs/disentangling-vae
Experiments for understanding disentanglement in VAE latent representations
takluyver/nbopen
Open a Jupyter notebook in the best available server
probabilists/zuko
Normalizing flows in PyTorch
MLSpeech/FormantsTracker
pretix/pretix
Ticket shop application for conferences, festivals, concerts, tech events, shows, exhibitions, workshops, barcamps, etc.
google-research/disentanglement_lib
disentanglement_lib is an open-source library for research on learning disentangled representations.
audeering/opensmile-python
Python package for openSMILE
huggingface/diffusion-models-class
Materials for the Hugging Face Diffusion Models Course
Orange-OpenSource/diSpeech
Materials to generate diSpeech datasets, composed of phonemes generated with Klatt synthetizer, for speech disentanglement purposes.
dair-ai/ML-YouTube-Courses
📺 Discover the latest machine learning / AI courses on YouTube.
tuanvu92/Intelligible_VC
gustavo-beck/wavebender-gan
shubhamgrg04/awesome-diagramming
A curated collection of diagramming tools used by leading software engineering teams
muqiaoy/eGeMAPS_estimator
HidekiKawahara/CAPRICEP
An extended TSP (Time Stretched Pulse). CAPRICEP substantially replaces FVN. CAPRICEP enables interactive and real-time measurement of the linear time-invariant, the non-linear time-invariant, and random and time varying responses simultaneously.
XiaoyuBIE1994/DVAE
Official implementation of Dynamical VAEs