MingjieChen

Postdoc researcher, Speech Processing, Natural Language Processing, Conversational AI, at University of Sheffield

MingjieChen's Stars

suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook36.2k 331 4444.3k
svc-develop-team/so-vits-svc
SoftVC VITS Singing Voice Conversion
Language:Python25.9k 179 1304.8k
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Language:Python12.5k 139 7171.3k
openai/consistency_models
Official repo for consistency models.
Language:Python6.2k 59 52420
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Language:Python4.8k 41 566716
haoheliu/AudioLDM
AudioLDM: Generate speech, sound effects, music and beyond, with text.
Language:Python2.5k 42 107222
lucidrains/audiolm-pytorch
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Language:Python2.4k 62 171265
archinetai/audio-ai-timeline
A timeline of the latest AI models for audio generation, starting in 2023!
1.9k 170 469
microsoft/i-Code
Language:Jupyter Notebook1.7k 40 74161
lucidrains/naturalspeech2-pytorch
Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch
Language:Python1.3k 53 31101
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline
945 85 460
openvpi/audio-slicer
Python script that slices audio with silence detection
Language:Python776 8 11270
wenet-e2e/wespeaker
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
Language:Python729 16 128121
heatz123/naturalspeech
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Language:Python470 30 3468
liusongxiang/Large-Audio-Models
Keep track of big models in audio domain, including speech, singing, music etc.
457 46 228
maxrmorrison/torchcrepe
Pytorch implementation of the CREPE pitch tracker
Language:Python408 8 2863
keonlee9420/DiffGAN-TTS
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
Language:Python320 10 2744
microsoft/Pengi
An Audio Language model for Audio Tasks
Language:Python290 14 1416
b04901014/MQTTS
Language:Python254 12 1136
quickvc/QuickVC-VoiceConversion
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Language:Python226 21 2426
XinhaoMei/WavCaps
This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.
Language:Python205 5 2612
descriptinc/cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
Language:Python188 22 1429
roatienza/efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
Language:Jupyter Notebook156 6 928
mechanicalsea/lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Language:Python69 4 86
unilight/LDNet
Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"
Language:Python61 4 310
dansuh17/jdcnet-pytorch
pytorch implementation of JDCNet, singing voice detection and classification network
Language:Python49 1 65
ConferencingSpeech/ConferencingSpeech2022
Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications
Language:Python42 5 27
TUM-DAML/dbu-robustness
Evaluating Robustness of Predictive Uncertainty Estimation: Are Dirichlet-based Models Reliable ? (ICML 2021)
Language:Python27 2 00
jerinphilip/dirichlet-prior-networks
Language:Jupyter Notebook8 3 22
pygongnlp/dialog_evaluation_paper_list
Dialog Evaluation Paper List: include multiple different dialog tasks
4 1 01

MingjieChen

MingjieChen's Stars

suno-ai/bark

svc-develop-team/so-vits-svc

m-bain/whisperX

openai/consistency_models

Plachtaa/VITS-fast-fine-tuning

haoheliu/AudioLDM

lucidrains/audiolm-pytorch

archinetai/audio-ai-timeline

microsoft/i-Code

lucidrains/naturalspeech2-pytorch

hollobit/GenAI_LLM_timeline

openvpi/audio-slicer

wenet-e2e/wespeaker

heatz123/naturalspeech

liusongxiang/Large-Audio-Models

maxrmorrison/torchcrepe

keonlee9420/DiffGAN-TTS

microsoft/Pengi

b04901014/MQTTS

quickvc/QuickVC-VoiceConversion

XinhaoMei/WavCaps

descriptinc/cargan

roatienza/efficientspeech

mechanicalsea/lighthubert

unilight/LDNet

dansuh17/jdcnet-pytorch

ConferencingSpeech/ConferencingSpeech2022

TUM-DAML/dbu-robustness

jerinphilip/dirichlet-prior-networks

pygongnlp/dialog_evaluation_paper_list