windforestfiremountain

Hohai UniversityJiangSu Nanjing

windforestfiremountain's Stars

2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python32.9k 188 5753.6k
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
Language:Python9k 134 1.1k1.4k
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python6.8k 63 567725
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Language:Python3.2k 98 118286
r9y9/wavenet_vocoder
WaveNet vocoder
Language:Python2.3k 96 193502
LCAV/pyroomacoustics
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
Language:Python1.5k 40 239433
BytedanceSpeech/seed-tts-eval
Language:Python1.1k 13 15106
yeyupiaoling/VoiceprintRecognition-Pytorch
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
Language:Python842 10 69126
facebookresearch/WavAugment
A library for speech data augmentation in time-domain
Language:Python653 25 1758
OlaWod/FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Language:Python606 19 88111
HarryVolek/PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Language:Python578 19 74164
schmiph2/pysepm
Python implementation of performance metrics in Loizou's Speech Enhancement book
Language:Python394 9 1287
LSimon95/megatts2
Unoffical implementation of Megatts2
Language:Python270 22 2035
tarepan/SpeechMOS
Easy-to-Use Speech MOS predictors
Language:Python235 7 1516
yistLin/FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
Language:Python202 13 2738
sarulab-speech/UTMOSv2
UTokyo-SaruLab MOS Prediction System
Language:Python105 5 58
unilight/s3prl-vc
S3PRL-VC: A Voice Conversion Toolkit based on S3PRL
Language:Python97 3 412
audiolabs/rir-generator
Language:Python77 5 115
yistLin/universal-vocoder
A PyTorch implementation of the universal neural vocoder
Language:Python67 4 39
XiangLi2022/CM-TTS
[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models
Language:Python62 4 33
YuanGongND/python-compute-eer
Simple Python script to compute equal error rate (EER) for machine learning model evaluation.
Language:Python39 1 35
OlaWod/PitchVC
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
Language:Python34 5 44
theshi-1128/llm-defense
An easy-to-use Python framework to defend against jailbreak prompts.
Language:Python160
SandyPanda-MLDL/-Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-Models
Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models
Language:Jupyter Notebook12 1 12
Edresson/Speech2Phone
Speech2Phone: A Multilingual and Text Independent Speaker Identification Model
Language:Jupyter Notebook9 3 01
daved01/Adversarial_Examples
Review and analysis of selected adversarial attacks. We implement common attack methods and evaluate them with a GoogleNet network on ImageNet like data.
Language:Jupyter Notebook3 1 00
gino0950150/RW_VoiceShield
Language:Python1 1 0
MaxMax2016/DeepSpeaker_RawNet_GE2E
分别在VCTK、AISHELL1 和 VoxCeleb1 三个标准公开数据集上对三种端到端声纹模型框架（Deep Speaker, RawNet, GE2E）进行实验比较。
Language:Python1 0 0
VoicePrivacy/Adeversarial-Speech-with-YourTTS
Language:HTML1 1 0
ztMotaLee/previous_homepage
Baiang Li's homepage.
Language:HTML10

windforestfiremountain

windforestfiremountain's Stars

2noise/ChatTTS

speechbrain/speechbrain

FunAudioLLM/CosyVoice

gpt-omni/mini-omni

r9y9/wavenet_vocoder

LCAV/pyroomacoustics

BytedanceSpeech/seed-tts-eval

yeyupiaoling/VoiceprintRecognition-Pytorch

facebookresearch/WavAugment

OlaWod/FreeVC

HarryVolek/PyTorch_Speaker_Verification

schmiph2/pysepm

LSimon95/megatts2

tarepan/SpeechMOS

yistLin/FragmentVC

sarulab-speech/UTMOSv2

unilight/s3prl-vc

audiolabs/rir-generator

yistLin/universal-vocoder

XiangLi2022/CM-TTS

YuanGongND/python-compute-eer

OlaWod/PitchVC

theshi-1128/llm-defense

SandyPanda-MLDL/-Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-Models

Edresson/Speech2Phone

daved01/Adversarial_Examples

gino0950150/RW_VoiceShield

MaxMax2016/DeepSpeaker_RawNet_GE2E

VoicePrivacy/Adeversarial-Speech-with-YourTTS

ztMotaLee/previous_homepage