YinPing-Cho
Pursuing master's degree in Electrical Engineering. Field of interest being digital signal processing and computer intelligence.
Dept. Electrical Engineering, National Tsinghua UniversityHsinchu, Taiwan
YinPing-Cho's Stars
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
cvxpy/cvxpy
A Python-embedded modeling language for convex optimization problems.
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
MTG/essentia
C++ library for audio and music analysis, description and synthesis, including Python bindings
MiteshPuthran/Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
VainF/pytorch-msssim
Fast and differentiable MS-SSIM and SSIM for pytorch.
aliutkus/speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
slaypni/fastdtw
A Python implementation of FastDTW
NVlabs/denoising-diffusion-gan
Tackling the Generative Learning Trilemma with Denoising Diffusion GANs https://arxiv.org/abs/2112.07804
gabrielmittag/NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
OverLordGoldDragon/ssqueezepy
Synchrosqueezing, wavelet transforms, and time-frequency analysis in Python
k2kobayashi/sprocket
Voice Conversion Tool Kit
w86763777/pytorch-ddpm
Unofficial PyTorch implementation of Denoising Diffusion Probabilistic Models
r9y9/pysptk
A python wrapper for Speech Signal Processing Toolkit (SPTK).
MoonInTheRiver/NeuralSVB
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
mpc001/Lipreading_using_Temporal_Convolutional_Networks
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASSP'20 Lipreading using Temporal Convolutional Networks
lochenchou/MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
mpariente/pystoi
Python implementation of the Short Term Objective Intelligibility measure
YatingMusic/ddsp-singing-vocoders
Official implementation of SawSing (ISMIR'22)
keonlee9420/DiffSinger
PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)
mpc001/end-to-end-lipreading
Pytorch code for End-to-End Audiovisual Speech Recognition
JasonSWFu/Quality-Net
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)
ebrevdo/synchrosqueezing
The MATLAB Synchrosqueezing Toolbox
Multi-Singer/Multi-Singer.github.io
RoyChao19477/PCS
Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)
ttslr/python-MCD
hsinyilin19/Discriminator-Constrained-Optimal-Transport-Network
chomeyama/UnifiedSourceFilterGAN
YinPing-Cho/PCS-FIR-Filter
A time-domain extension to "Perceptual Contrast Stretching on Target Feature for Speech Enhancement"
YinPing-Cho/NTHU-ASP2021-Final
Final project repo for grad-level Adaptive Signal Processing course at National Tsinghua University, Taiwan.