vsooda

@bytedanceShanghai, China

Pinned Repositories

mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Language:C++20.7k 1.1k 9.6k6.8k
AByteOfCV
implement of computer vision algorithm using opencv
Language:C++1 2 00
kaldi
This is now the official location of the Kaldi project.
Language:Shell2 2 00
marytts
MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Language:Java1 2 00
mxnet-wgan
mxnet implement for Conditional Wasserstein GAN
Language:Python20 3 25
mxnet_kaldi
use mxnet and kaldi to train asr model
Language:Python6 3 02
mxnet_merge_bn
remove batchnorm when deploy mxnet model
Language:Python5 2 00
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Language:C++13 6 04
vsooda.github.io
my blog
Language:JavaScript2 4 02

vsooda's Repositories

vsooda/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Language:C++13 6 04
vsooda/mxnet_kaldi
use mxnet and kaldi to train asr model
Language:Python6 3 02
vsooda/mxnet_merge_bn
remove batchnorm when deploy mxnet model
Language:Python5 2 00
vsooda/kaldi
This is now the official location of the Kaldi project.
Language:Shell2 2 00
vsooda/vsooda.github.io
my blog
Language:JavaScript2 4 02
vsooda/caffe-spn
Codes for Learning Affinity via Spatial Propagation Networks
Language:Jupyter Notebook1 2 021
vsooda/MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Language:JavaScript1 0 0
vsooda/Sinsy-Remix
The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"
Language:C++1 2 0
vsooda/Audio-Effects
Collection of audio effects plugins implemented from the explanations in the book "Audio Effects: Theory, Implementation and Application" by Joshua D. Reiss and Andrew P. McPherson.
Language:C++2 0
vsooda/chainer-VQ-VAE
A Chainer implementation of VQ-VAE.
Language:Python2 0
vsooda/ClariNet
A Pytorch Implementation of ClariNet
Language:Python2 0
vsooda/crnn.pytorch
Convolutional recurrent network in pytorch
Language:Python2 0
vsooda/deepvoice3_pytorch
PyTorch implementation of convolutional networks-based text-to-speech synthesis models
Language:Python2 0
vsooda/FloWaveNet
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
Language:Python2 0
vsooda/kws
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
Language:Python2 0
vsooda/labelImg
:metal: LabelImg is a graphical image annotation tool and label object bounding boxes in images
Language:Python2 0
vsooda/LPCNet
Efficient neural speech synthesis
Language:C2 0
vsooda/MNN
MNN is a lightweight deep neural network inference engine.
Language:C++1 0
vsooda/MobileNetV2.mxnet
A MXNet/Gluon implementation of MobileNetV2
Language:Python2 0
vsooda/mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
Language:C++2 0
vsooda/parallel_wavenet_vocoder
Parallel WaveNet Vocoder Based on ClariNet
Language:Python2 0
vsooda/Phonetisaurus
Phonetisaurus G2P
Language:Python2 0
vsooda/PRML
PRML algorithms implemented in Python
Language:Jupyter Notebook1 01
vsooda/pytorch-caffe-darknet-convert
convert between pytorch, caffe prototxt/weights and darknet cfg/weights
Language:Python2 0
vsooda/resume
An elegant \LaTeX\ résumé template
Language:TeX2 0
vsooda/see
Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"
Language:Python2 0
vsooda/VAD
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Language:Matlab2 0
vsooda/vimrc
The ultimate Vim configuration: vimrc
Language:Vim script2 02
vsooda/YAD2K
YAD2K: Yet Another Darknet 2 Keras
Language:Python2 0
vsooda/yolo2-pytorch
YOLOv2 in PyTorch
Language:Python2 0