Pinned Repositories
mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
AByteOfCV
implement of computer vision algorithm using opencv
kaldi
This is now the official location of the Kaldi project.
marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
mxnet-wgan
mxnet implement for Conditional Wasserstein GAN
mxnet_kaldi
use mxnet and kaldi to train asr model
mxnet_merge_bn
remove batchnorm when deploy mxnet model
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
vsooda.github.io
my blog
vsooda's Repositories
vsooda/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
vsooda/mxnet_kaldi
use mxnet and kaldi to train asr model
vsooda/mxnet_merge_bn
remove batchnorm when deploy mxnet model
vsooda/kaldi
This is now the official location of the Kaldi project.
vsooda/vsooda.github.io
my blog
vsooda/caffe-spn
Codes for Learning Affinity via Spatial Propagation Networks
vsooda/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
vsooda/Sinsy-Remix
The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"
vsooda/Audio-Effects
Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.
vsooda/chainer-VQ-VAE
A Chainer implementation of VQ-VAE.
vsooda/ClariNet
A Pytorch Implementation of ClariNet
vsooda/crnn.pytorch
Convolutional recurrent network in pytorch
vsooda/deepvoice3_pytorch
PyTorch implementation of convolutional networks-based text-to-speech synthesis models
vsooda/FloWaveNet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
vsooda/kws
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
vsooda/labelImg
:metal: LabelImg is a graphical image annotation tool and label object bounding boxes in images
vsooda/LPCNet
Efficient neural speech synthesis
vsooda/MNN
MNN is a lightweight deep neural network inference engine.
vsooda/MobileNetV2.mxnet
A MXNet/Gluon implementation of MobileNetV2
vsooda/mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
vsooda/parallel_wavenet_vocoder
Parallel WaveNet Vocoder Based on ClariNet
vsooda/Phonetisaurus
Phonetisaurus G2P
vsooda/PRML
PRML algorithms implemented in Python
vsooda/pytorch-caffe-darknet-convert
convert between pytorch, caffe prototxt/weights and darknet cfg/weights
vsooda/resume
An elegant \LaTeX\ résumé template
vsooda/see
Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"
vsooda/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
vsooda/vimrc
The ultimate Vim configuration: vimrc
vsooda/YAD2K
YAD2K: Yet Another Darknet 2 Keras
vsooda/yolo2-pytorch
YOLOv2 in PyTorch