MingjieChen

Postdoc researcher, Speech Processing, Natural Language Processing, Conversational AI, at University of Sheffield

MingjieChen's Stars

karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python36.5k 371 3155.7k
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
12k 270 109769
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Language:Python8.2k 72 407822
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Language:Jupyter Notebook6k 71 989758
guillaumekln/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python5.7k 100 382426
wenet-e2e/wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Language:Python4.1k 90 1k1.1k
microsoft/i-Code
Language:Jupyter Notebook1.7k 40 74161
wq2012/awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
1.6k 78 8225
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
Language:Python1.2k 45 4383
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
Language:Python1.1k 15 5085
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline
941 84 460
DmitryRyumin/INTERSPEECH-2023-Papers
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
616 87 442
yangdongchao/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Language:Python572 31 4079
ivanvovk/WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
Language:Jupyter Notebook402 17 2655
audiolabs/webMUSHRA
a MUSHRA compliant web audio API based experiment software
Language:JavaScript346 18 82135
lablab-ai/Whisper-transcription_and_diarization-speaker-identification-
How to use OpenAIs Whisper to transcribe and diarize audio files
Language:Jupyter Notebook286 6 841
microsoft/Pengi
An Audio Language model for Audio Tasks
Language:Python282 14 1315
adelacvg/NS2VC
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
Language:Python228 19 3712
DongKeon/Awesome-Speaker-Diarization
Some comprehensive papers about speaker diarization
198 11 03
roatienza/efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
Language:Jupyter Notebook150 6 926
jasonppy/PromptingWhisper
Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation
Language:Python132 4 811
ga642381/Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
103 11 25
pyf98/DPHuBERT
INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"
Language:Python101 6 59
patrickltobing/cyclevae-vc-neuralvoco
Language:Python90 6 519
facebookresearch/Noresqa
This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.
Language:Python84 7 1012
linan2/Voice-activity-detection-VAD-paper-and-code
Voice activity detection (VAD) paper and code（From 198*~ ）and its classification.
84 6 012
caskcsg/SPCL
code for "Supervised Prototypical Contrastive Learning for Emotion Recognition in Conversation, EMNLP 22"
Language:Python73 2 68
guxm2021/ALT_SpeechBrain
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
Language:Python44 1 46
W-Wu/DEER
Language:Python91
ZhihaoDU/du2022sond
Speaker overlap-aware Neural Diarization
90

MingjieChen

MingjieChen's Stars

karpathy/nanoGPT

BradyFU/Awesome-Multimodal-Large-Language-Models

OptimalScale/LMFlow

pyannote/pyannote-audio

guillaumekln/faster-whisper

wenet-e2e/wenet

microsoft/i-Code

wq2012/awesome-diarization

0nutation/SpeechGPT

atong01/conditional-flow-matching

hollobit/GenAI_LLM_timeline

DmitryRyumin/INTERSPEECH-2023-Papers

yangdongchao/AcademiCodec

ivanvovk/WaveGrad

audiolabs/webMUSHRA

lablab-ai/Whisper-transcription_and_diarization-speaker-identification-

microsoft/Pengi

adelacvg/NS2VC

DongKeon/Awesome-Speaker-Diarization

roatienza/efficientspeech

jasonppy/PromptingWhisper

ga642381/Speech-Prompts-Adapters

pyf98/DPHuBERT

patrickltobing/cyclevae-vc-neuralvoco

facebookresearch/Noresqa

linan2/Voice-activity-detection-VAD-paper-and-code

caskcsg/SPCL

guxm2021/ALT_SpeechBrain

W-Wu/DEER

ZhihaoDU/du2022sond