Pinned Repositories
1D-Condition-method-pytorch
Conditioning and feature fusion methods such as FiLM, Conditional Layer Norm and AdaIN.
AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
asvspoof2019_wav2vec2
bigban
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
CycleDiff
DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
EAD-VC
EADVC
EADVC.github.io
rechawine's Repositories
rechawine/asvspoof2019_wav2vec2
rechawine/1D-Condition-method-pytorch
Conditioning and feature fusion methods such as FiLM, Conditional Layer Norm and AdaIN.
rechawine/AcademiCodec
AcademiCodec: An Open Source Audio Codec Model for Academic Research
rechawine/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
rechawine/CycleDiff
rechawine/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
rechawine/EAD-VC
rechawine/EADVC
rechawine/EADVC.github.io
rechawine/FullConv-TTS
rechawine/huawei
rechawine/g2p-mix
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English
rechawine/ipa-tokenizer
IPA transcription tokenizer
rechawine/MeloTTS
中英混TTS: High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
rechawine/MUSE-Speech-Enhancement
Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enhancemen
rechawine/NATSpeech
A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
rechawine/nlp_project
rechawine/objective_fake_audio_detect
rechawine/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
rechawine/phonemizer
Simple text to phones converter for multiple languages
rechawine/PitchExtractor
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
rechawine/SECap
探索语音情感的文本描述,解决现有模型探索能力不足的问题
rechawine/specAugment_tool
Data Augmentation Methods for Speech
rechawine/speech_lm_score
无监督的音频生成模型打分
rechawine/SRD-VC
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
rechawine/StableTTS
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
rechawine/Text2PhonemeSequence
VITS专用文本转音素, 文本转国际音标
rechawine/vall-e
rechawine/vc-lm
使用encodec, 将音频离散化成tokens, 该项目包含两阶段模型 AR模型和NAR模型。
rechawine/whisper_ppg
whisper ppg for voice change