smallflyingpig

B.S of Huazhong University of Science and Technology (HUST). Ph.D of University of Chinese Academy of Science (UCAS).

Pinned Repositories

Generative_Adversarial_Networks_PyTorch
(PyTorch) Implementations of GAN, Improved GAN, DCGAN, LAPGAN, and InfoGAN in PyTorch
Language:Python2 2 01
learning-to-fool-the-speaker-recognition
code for paper "learning to fool the speaker recognition"
Language:Python10 3 37
projects
some projects for course
Language:Python28 4 055
pytorch_examples
some examples by pytorch
Language:Python1 2 02
pytorch_video_caption
some models for video caption implemented by pytorch. (S2VT)
Language:Python23 3 24
SoundNet_Pytorch
converting the pretrained tensorflow SoundNet model to pytorch
Language:Python13 2 43
speech-to-image-translation-without-text
Code for paper "direct speech-to-image translation"
Language:Python27 4 06
Surround360
Surround360 is Facebook's open source hardware and software for capturing stereoscopic 3D 360 video for VR. The repo contains hardware designs, as well as software for camera control and rendering.
Language:C++1 2 00
universal_adversarial_perturbation_generative_network_for_speaker_recognition
code for paper "Universal Adversarial Perturbations Generative Network for Speaker Recognition"
Language:Python22 3 46
xavs2
xavs2 is an open-source encoder of Chinese AVS2 video coding standard.
Language:C1 2 00

smallflyingpig's Repositories

smallflyingpig/lstm_stock_pred_pytorch
stock prediction via lstm using pytorch
Language:Python7 2 03
smallflyingpig/python_audio
python code for audio processing
Language:Jupyter Notebook1
smallflyingpig/3D-convolutional-speaker-recognition-pytorch
:speaker: Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
smallflyingpig/AMC-GAN
reimplement of AMC-GAN: https://sites.google.com/vision.snu.ac.kr/icml2018-video-prediction
smallflyingpig/avspeech-downloader
AVSpeech downloader
Language:Shell
smallflyingpig/crnn-pytorch
Pytorch implementation of OCR system using CRNN + CTCLoss
Language:Python
smallflyingpig/datamining
learn in datamining
Language:Python
smallflyingpig/DAVEnet-pytorch
Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch
Language:Python
smallflyingpig/docker_examples
some examples for docker/nvidia_docker
Language:Python
smallflyingpig/draw_pytorch
DRAW: A Recurrent Neural Network For Image Generation
Language:Python
smallflyingpig/faster-rcnn.pytorch
A faster pytorch implementation of faster r-cnn
Language:Python2 0
smallflyingpig/improved_gan_training
Language:Python
smallflyingpig/l3embedding
Learn and L3 embedding from audio/video pairs
Language:Jupyter Notebook
smallflyingpig/MNIST-baselines
Baseline classifiers on the polluted MNIST dataset, SJTU CS420 course project
Language:Python
smallflyingpig/mocogan
MoCoGAN: Decomposing Motion and Content for Video Generation
Language:Python
smallflyingpig/PWC-Net
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume, CVPR 2018 (Oral)
Language:Cuda
smallflyingpig/pycochleagram
Generate cochleagrams natively in Python. Ported from Josh McDermott's MATLAB code.
Language:Python
smallflyingpig/Python-Bing-TTS
Microsoft Bing Text to Speech library for Python
Language:Python
smallflyingpig/python_nlp
text processing with python
Language:Jupyter Notebook
smallflyingpig/pytorch-CycleGAN-and-pix2pix
Image-to-image translation in PyTorch (e.g., horse2zebra, edges2cats, and more)
Language:Python
smallflyingpig/pytorch-fid
A Port of Fréchet Inception Distance (FID score) to PyTorch
Language:Python
smallflyingpig/senet.pytorch
PyTorch implementation of SENet
Language:Python3 0
smallflyingpig/StackGAN
Language:Python
smallflyingpig/StackGAN-v2
Language:Python
smallflyingpig/tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
Language:Jupyter Notebook3 0
smallflyingpig/torchdiffeq
Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.
Language:Python
smallflyingpig/tutorials
Caffe2 Tutorials
Language:Python
smallflyingpig/VIG
Dataset for Visually Indicated Sound Generation by Perceptually Optimized Classification
Language:Python
smallflyingpig/waveglow
A Flow-based Generative Network for Speech Synthesis
Language:Python3 0
smallflyingpig/wavenet_vocoder
WaveNet vocoder
Language:Python3 0