hyzhan

Guangzhou

Pinned Repositories

auraloss
Collection of audio-focused loss functions in PyTorch
Language:Python0 1 00
caffe
Caffe: a fast open framework for deep learning.
Language:C++0 2 00
caffe-fast-rcnn
Caffe fork that supports Fast R-CNN
Language:C++0 2 00
Chinese_conversation_sentiment
A Chinese sentiment dataset may be useful for sentiment analysis.
0 1 00
ClariNet
A Pytorch Implementation of ClariNet
Language:Python0 2 00
cnn_graph
Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering
Language:Jupyter Notebook0 2 00
code01
Language:Python0 1 00
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python0 0 00
deep-voice-conversion
Deep neural networks for voice conversion (voice style transfer) in Tensorflow
Language:Python0 1 00
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
Language:Python0 1 00

hyzhan's Repositories

hyzhan/Chinese_conversation_sentiment
A Chinese sentiment dataset may be useful for sentiment analysis.
0 1 00
hyzhan/NeoPI
Language:Python2 0
hyzhan/node-openjtalk
Japanese text-to-speech engine binding for NodeJS
Language:C++2 0
hyzhan/test
Language:C++

hyzhan

Pinned Repositories

auraloss

caffe

caffe-fast-rcnn

Chinese_conversation_sentiment

ClariNet

cnn_graph

code01

CosyVoice

deep-voice-conversion

denoiser

hyzhan's Repositories

hyzhan/Chinese_conversation_sentiment

hyzhan/NeoPI

hyzhan/node-openjtalk

hyzhan/test