MisakaMikoto96's Stars
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
fuergaosi233/wechat-chatgpt
Use ChatGPT On Wechat via wechaty
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
huggingface/trl
Train transformer language models with reinforcement learning.
modelscope/modelscope
ModelScope: bring the notion of Model-as-a-Service to life.
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
facebookresearch/encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
eugeneyan/ml-surveys
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
s3prl/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
HarderThenHarder/transformers_tasks
⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.
zoubohao/DenoisingDiffusionProbabilityModel-ddpm-
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
NiuTrans/CNSurvey
一份中文综述文章列表(自然语言处理&机器学习)
wenet-e2e/WeTextProcessing
Text Normalization & Inverse Text Normalization
Rongjiehuang/FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
exodrifter/unity-python
Python plugin for Unity3D.
bshall/hubert
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
shanglianlm0525/CvPytorch
CvPytorch is an open source COMPUTER VISION toolbox based on PyTorch.
keonlee9420/Cross-Speaker-Emotion-Transfer
PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech
tts-tutorial/interspeech2022
Zain-Jiang/Dict-TTS
speechandlanguageprocessing/ICASSP2022-Depression
Automatic Depression Detection: a GRU/ BiLSTM-based Model and An Emotional Audio-Textual Corpus
Daisyqk/Automatic-Prosody-Annotation
atosystem/SpeechCLIP
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
KunZhou9646/Mixed_Emotions
bshall/acoustic-model
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Spijkervet/contrastive-predictive-coding
PyTorch implementation of Representation Learning with Contrastive Predictive Coding by Van den Oord et al. (2018)
JSALT-2022-SSL/superb-prosody
funderburkjim/pynini-learn
Learning the Levenshtein Automaton of Pynini library
BinhMinhs10/FST_transducer_grammer
pynini operation