catherine-qian's Stars
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
google-research/google-research
Google Research
amusi/CVPR2023-Papers-with-Code
CVPR 2023 论文和开源项目合集
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
yenchenlin/nerf-pytorch
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
TachibanaYoshino/AnimeGANv2
[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime
TachibanaYoshino/AnimeGAN
A Tensorflow implementation of AnimeGAN for fast photo animation ! This is the Open source of the paper 「AnimeGAN: a novel lightweight GAN for photo animation」, which uses the GAN framwork to transform real-world photos into anime images.
pengzhiliang/MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
TorchSSL/TorchSSL
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
YudongGuo/AD-NeRF
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
descriptinc/lyrebird-wav2clip
Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
y0ast/VAE-Torch
Implementation of Variational Auto-Encoder in Torch7
danmic/av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
limbo0000/InstanceLoc
[CVPR 2021] Instance Localization for Self-supervised Detection Pretraining
julianstastny/VAE-ResNet18-PyTorch
A Variational Autoencoder based on the ResNet18-architecture
MarSaKi/NvEM
[ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"
yinkalario/EIN-SELD
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
lilianemomeni/KWS-Net
Seeing Wake Words: Audio-visual Keyword Spotting
robot-learning-freiburg/MM-DistillNet
PyTorch code for training MM-DistillNet for multimodal knowledge distillation. http://rl.uni-freiburg.de/research/multimodal-distill
krantiparida/beyond-image-to-depth
jiamings/ml-cpc
skelemoa/tal-hmo
Fusional approaches for temporal action localization in untrimmed videos
facebookresearch/VisualEchoes
VisualEchoes Dataset (ECCV 2020)
chahuja/aisle
Official Repository for the paper "No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gestures", Findings at EMNLP 2020
nishantrai18/cocon
CoCon: Cooperative Contrastive Learning
thomeou/General-network-architecture-for-sound-event-localization-and-detection
This repository consists of python code to train sound event localization and detection models.
kampta/PatchVAE
PyTorch implementation of "PatchVAE: Learning Local Latent Codes for Recognition" to appear in CVPR 2020
juanmavera/ADENET
Code used in the paper "Towards End-to-End Acoustic Localization using Deep Learning: from Audio Signal to Source Position Coordinates"
yuhanghe01/OpenSound
Various Audio Process Baselines