lunar333

lunar333's Stars

asteroid-team/asteroid
The PyTorch-based audio source separation toolkit for researchers
Language:Python2.2k422
resemble-ai/Resemblyzer
A python package to analyze and compare voices with deep learning
Language:Python2.7k422
dtlnor/stable-diffusion-webui-localization-zh_CN
Simplified Chinese translation extension for AUTOMATIC1111's stable diffusion webui
1.5k157
bshall/hubert
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
Language:Python32353
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.2k6.4k
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.4k5.2k
resautu/chat-with-Elysia
Language:Python3710
serp-ai/bark-with-voice-clone
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
Language:Jupyter Notebook3.1k411
vocaliodmiku/wav2vec2mdd
End-to-End Mispronunciation Detection via wav2vec2.0
Language:Python406
b04901014/FT-w2v2-ser
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
Language:Python13632
Renovamen/Speech-Emotion-Recognition
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
Language:Python974218
DemisEom/SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Language:Python639135
cageyoko/CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
Language:Python5621
xmu-xiaoma666/External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Language:Python11.3k1.9k
b04901014/FG-transformer-TTS
Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.
Language:Python8611
CjangCjengh/vits
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
Language:Python908195
innnky/emotional-vits
无需情感标注的情感可控语音合成模型，基于VITS
Language:Jupyter Notebook1.3k167
KinglittleQ/GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Language:Python35772
Plachtaa/VITS-fast-fine-tuning
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
Language:Python4.7k704
prophesier/diff-svc
Singing Voice Conversion via diffusion model
Language:Jupyter Notebook2.6k802
TParcollet/E2E-SincNet
E2E-SincNet: Toward fully end-to-end speech recognition
Language:Shell294
CjangCjengh/MoeGoe
Executable file for VITS inference
Language:Python2.3k249