Pinned Repositories
ae-w2v-attention
AFWDataset
Annotated Faces in the Wild Dataset with modified labels for my bachelor thesis
arctic-captions
audio-classifier
Classify sounds using YouTube-8M and VGGish models
audioset
Fetch and use Google's AudioSet dataset
audioset_tagging_cnn
autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
KAMA_AC
Postgraduate-entrance
北大信工考研资料整理及经验贴
zjy.github.io
Vancause's Repositories
Vancause/KAMA_AC
Vancause/ae-w2v-attention
Vancause/audio-classifier
Classify sounds using YouTube-8M and VGGish models
Vancause/audioset_tagging_cnn
Vancause/CDur
Repository for the paper "Towards duration robust weakly supervised sound event detection"
Vancause/clip-event
Vancause/zjy.github.io
Vancause/coala
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations
Vancause/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Vancause/DCASE2021-Task1b
Audio-Visual Classifier in Acoustic Scene Clasification
Vancause/DCASE2021_task6_v2
Code for CVSSP submission to DCASE 2021 Task 6
Vancause/dcase_2020_T6
2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning-results#wuyusong2020_t6
Vancause/deepsvg
[NeurIPS 2020] Official code for the paper "DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation". Includes a PyTorch library for deep learning with SVG data.
Vancause/DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
Vancause/dual_encoding
[CVPR2019] Dual Encoding for Zero-Example Video Retrieval
Vancause/FeatureCut_Y
Vancause/FullSubNet
PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Vancause/HAKE-Action-Torch
HAKE-Action in PyTorch
Vancause/Meta-DETR
Meta-DETR: Official PyTorch Implementation
Vancause/PAGAN
PAGAN: a phase-adapted GAN for speech enhancement
Vancause/ppg-vc
PPG-Based Voice Conversion
Vancause/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Vancause/Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Vancause/retrieval-augmentation-nn
Generalization of deep neural networks by using the information of nearest training examples
Vancause/SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Vancause/SD-FSIC
Vancause/SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
Vancause/Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Vancause/vc_Real-Time-Voice-Cloning
clone Real-Time-Voice-Cloning to test
Vancause/vcc20_baseline_cyclevae
Voice Conversion Challenge 2020 CycleVAE baseline system