Pinned Repositories
-Audio-visualization
使用快速傅里叶变换(FFT)实现的音频文件的可视化
111
2.5D-Visual-Sound
2.5D visual sound
3D-ResNets-PyTorch
3D ResNets for Action Recognition (CVPR 2018)
academic-resume
acoustic-images-self-supervision
Code for the paper "Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning" ECCV 2020
Adaptive
Pytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
mvda
Discriminant Analysis Algorithms
Partial-Multi-Label-Learning
A curated list of resources for Partial-Multi-Label-Learning
solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
xuanhanyu's Repositories
xuanhanyu/acoustic-images-self-supervision
Code for the paper "Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning" ECCV 2020
xuanhanyu/Partial-Multi-Label-Learning
A curated list of resources for Partial-Multi-Label-Learning
xuanhanyu/solo-learn
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
xuanhanyu/-Audio-visualization
使用快速傅里叶变换(FFT)实现的音频文件的可视化
xuanhanyu/111
xuanhanyu/academic-resume
xuanhanyu/AudioDVP
AudioDVP:Photorealistic Audio-driven Video Portraits
xuanhanyu/AVID-CMA
Audio Visual Instance Discrimination with Cross-Modal Agreement
xuanhanyu/AVVP-ECCV20
Unified Multisensory Perception: Weakly-Supervised Audio-Visual Video Parsing, ECCV, 2020. (Spotlight)
xuanhanyu/awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
xuanhanyu/awesome-computer-vision
A curated list of awesome computer vision resources
xuanhanyu/CM-ACC
Cross-model active contrastive coding
xuanhanyu/deep_avsr
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
xuanhanyu/DeepClustering
Methods and Implements of Deep Clustering
xuanhanyu/KWS-Net
Seeing Wake Words: Audio-visual Keyword Spotting
xuanhanyu/Localizing-Visual-Sounds-the-Hard-Way
Localizing Visual Sounds the Hard Way
xuanhanyu/Mead
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation [ECCV2020]
xuanhanyu/moco
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
xuanhanyu/OpenSelfSup
Self-Supervised Learning Toolbox and Benchmark
xuanhanyu/PiCO
PyTorch implementation of PiCO https://arxiv.org/abs/2201.08984
xuanhanyu/PSOL
code repository of “Rethinking the Route Towards Weakly Supervised Object Localization” in CVPR 2020
xuanhanyu/PSP_CVPR_2021
A PyTorch implementation of the CVPR-2021 paper: Positive Sample Propagation along the Audio-Visual Event Line.
xuanhanyu/Separating-Sounds-from-a-Single-Image
PyTorch implementation of "Separating Sounds from a Single Image"
xuanhanyu/sound-spaces
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.
xuanhanyu/starter-hugo-academic
xuanhanyu/starter-hugo-portfolio-theme
xuanhanyu/Talking-Face_PC-AVS
Code for Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation (CVPR 2021)
xuanhanyu/VisualVoice
Audio-Visual Speech Separation with Cross-Modal Consistency
xuanhanyu/WAL
xuanhanyu/xuanhanyu.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes