catherine-qian

catherine-qian's Stars

labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python53.9k 446 1315.6k
google-research/google-research
Google Research
Language:Jupyter Notebook33.8k 751 1.2k7.8k
amusi/CVPR2023-Papers-with-Code
CVPR 2023 论文和开源项目合集
14.6k 280 1832.4k
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Language:Python6.5k 44 851.1k
yenchenlin/nerf-pytorch
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
Language:Python5.4k 53 1141k
TachibanaYoshino/AnimeGANv2
[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime
Language:Python5.1k 60 62710
TachibanaYoshino/AnimeGAN
A Tensorflow implementation of AnimeGAN for fast photo animation ! This is the Open source of the paper 「AnimeGAN: a novel lightweight GAN for photo animation」, which uses the GAN framwork to transform real-world photos into anime images.
Language:Python4.5k 102 55659
pengzhiliang/MAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Language:Python2.6k 24 96344
TorchSSL/TorchSSL
A PyTorch-based library for semi-supervised learning (NeurIPS'21)
Language:Python1.3k 14 63187
YudongGuo/AD-NeRF
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
Language:Python1k 16 138172
TaoRuijie/ECAPA-TDNN
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Language:Python586 4 80113
descriptinc/lyrebird-wav2clip
Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP
Language:Python323 11 1327
y0ast/VAE-Torch
Implementation of Variational Auto-Encoder in Torch7
Language:Lua267 18 1562
danmic/av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
199 12 222
limbo0000/InstanceLoc
[CVPR 2021] Instance Localization for Self-supervised Detection Pretraining
Language:Python143 9 2212
julianstastny/VAE-ResNet18-PyTorch
A Variational Autoencoder based on the ResNet18-architecture
Language:Python113 2 324
MarSaKi/NvEM
[ACM MM 2021 Oral] Official repo of "Neighbor-view Enhanced Model for Vision and Language Navigation"
Language:C++77 1 12
yinkalario/EIN-SELD
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection
Language:Python64 4 1415
lilianemomeni/KWS-Net
Seeing Wake Words: Audio-visual Keyword Spotting
Language:Python62 4 512
robot-learning-freiburg/MM-DistillNet
PyTorch code for training MM-DistillNet for multimodal knowledge distillation. http://rl.uni-freiburg.de/research/multimodal-distill
Language:Python59 8 2114
krantiparida/beyond-image-to-depth
Language:Python38 6 1213
jiamings/ml-cpc
Language:Jupyter Notebook36 2 14
skelemoa/tal-hmo
Fusional approaches for temporal action localization in untrimmed videos
Language:Python35 2 47
facebookresearch/VisualEchoes
VisualEchoes Dataset (ECCV 2020)
Language:Python34 6 73
chahuja/aisle
Official Repository for the paper "No Gestures Left Behind: Learning Relationships between Spoken Language and Freeform Gestures", Findings at EMNLP 2020
Language:Python20 2 14
nishantrai18/cocon
CoCon: Cooperative Contrastive Learning
Language:Python20 2 17
thomeou/General-network-architecture-for-sound-event-localization-and-detection
This repository consists of python code to train sound event localization and detection models.
Language:Python15 2 44
kampta/PatchVAE
PyTorch implementation of "PatchVAE: Learning Local Latent Codes for Recognition" to appear in CVPR 2020
Language:Python13 4 03
juanmavera/ADENET
Code used in the paper "Towards End-to-End Acoustic Localization using Deep Learning: from Audio Signal to Source Position Coordinates"
Language:Python6 0 11
yuhanghe01/OpenSound
Various Audio Process Baselines
6 1 00