MingjieChen
Postdoc researcher, Speech Processing, Natural Language Processing, Conversational AI, at University of Sheffield
MingjieChen's Stars
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
openai/consistency_models
Official repo for consistency models.
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
microsoft/i-Code
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline
openvpi/audio-slicer
Python script that slices audio with silence detection
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
liusongxiang/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
maxrmorrison/torchcrepe
Pytorch implementation of the CREPE pitch tracker
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
microsoft/Pengi
An Audio Language model for Audio Tasks
b04901014/MQTTS
quickvc/QuickVC-VoiceConversion
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
descriptinc/cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
roatienza/efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
mechanicalsea/lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
unilight/LDNet
Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"
dansuh17/jdcnet-pytorch
pytorch implementation of JDCNet, singing voice detection and classification network
ConferencingSpeech/ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
TUM-DAML/dbu-robustness
Evaluating Robustness of Predictive Uncertainty Estimation: Are Dirichlet-based Models Reliable ? (ICML 2021)
jerinphilip/dirichlet-prior-networks
pygongnlp/dialog_evaluation_paper_list
Dialog Evaluation Paper List: include multiple different dialog tasks