YuXU-Jouuuuuu's Stars
jiaaro/pydub
Manipulate audio with a simple and easy high level interface
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
RVC-Project/Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
AndreevP/wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
google/visqol
Perceptual Quality Estimator for speech and audio
nanahou/Awesome-Bandwidth-Extension
This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purpose of this repo is to organize the world’s resources for speech bandwidth extension, and make them universally accessible and useful.
eloimoliner/CQTdiff
Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23
jtkim-kaist/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
drowe67/codec2
Open source speech codec designed for communications quality speech between 700 and 3200 bit/s. The main application is low bandwidth HF/VHF digital radio.
guozixunnicolas/DENT_DDSP
TowerYsable/ASR_awesome
语音识别 论文 前沿
FChin39/Noise_Generator
haoheliu/voicefixer
General Speech Restoration
Wataru-Nakata/FastSpeech2-JSUT
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
ruizhecao96/CMGAN
Conformer-based Metric GAN for speech enhancement
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
PKUFlyingPig/cs-self-learning
计算机自学指南
gitmylo/bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
NXTProduct/TUNet
lochenchou/MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
pollen-robotics/dtw
DTW (Dynamic Time Warping) python module
PKUFlyingPig/Self-learning-Computer-Science
the resources I use to learn computer science in my spare time
xefonon/BandwidthExtensionRIRs
Deep generative models for extending the bandwidth of reconstructed room impulse responses corrupted by undersampling.
rishikksh20/HiFiplusplus-pytorch
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
aiyang8067/Hierarchical-Recurrent-Neural-Networks-for-Speech-Bandwidth-Extension
Codes of the paper: * Zhen-Hua Ling , Yang Ai, Yu Gu, and Li-Rong Dai, "Waveform Modeling and Generation Using Hierarchical Recurrent Neural Networks for Speech Bandwidth Extension," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 5, pp. 883-894, 2018.
CARNIVAL-IITP/Bandwidth-extension
bachhavpramod/bandwidth_extension
yoyololicon/diffwave-sr
d2l-ai/d2l-en
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.