TheSouthFrog's Stars
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
DefTruth/lite.ai.toolkit
🛠 A lite C++ toolkit of 100+ Awesome AI models, support ORT, MNN, NCNN, TNN and TensorRT. 🎉🎉
apple/ml-ane-transformers
Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)
xianfei/SysMocap
A real-time motion capture system for 3D virtual character animating.
OFA-Sys/OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
n2cholas/awesome-jax
JAX - A curated list of resources https://github.com/google/jax
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
czczup/ViT-Adapter
[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions
autonomousvision/stylegan-xl
[SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets
kakaobrain/rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
yandex-research/ddpm-segmentation
Label-Efficient Semantic Segmentation with Diffusion Models (ICLR'2022)
lucidrains/rotary-embedding-torch
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
omriav/blended-latent-diffusion
Official implementation for "Blended Latent Diffusion" [SIGGRAPH 2023]
Alpha-VL/ConvMAE
ConvMAE: Masked Convolution Meets Masked Autoencoders
google-research/maskgit
Official Jax Implementation of MaskGIT
microsoft/esvit
EsViT: Efficient self-supervised Vision Transformers
FacePerceiver/FaRL
FaRL for Facial Representation Learning [Official, CVPR 2022]
omerbt/Splice
Official Pytorch Implementation for "Splicing ViT Features for Semantic Appearance Transfer" presenting "Splice" (CVPR 2022 Oral)
jiawei-ren/BalancedMSE
[CVPR 2022 Oral] Balanced MSE for Imbalanced Visual Regression https://arxiv.org/abs/2203.16427
dvlab-research/FocalsConv
Focal Sparse Convolutional Networks for 3D Object Detection (CVPR 2022, Oral)
ShoufaChen/AdaptFormer
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
sayakpaul/probing-vits
Probing the representations of Vision Transformers.
Rayhane-mamah/Efficient-VDVAE
Official Pytorch and JAX implementation of "Efficient-VDVAE: Less is more"
LayneH/GreenMIM
[NeurIPS2022] Official implementation of the paper 'Green Hierarchical Vision Transformer for Masked Image Modeling'.
BeierZhu/Prompt-align
[ICCV 2023] Prompt-aligned Gradient for Prompt Tuning
colour-science/colour-hdri
HDRI / Radiance image processing algorithms for Python
fenglinglwb/EDT
On Efficient Transformer-Based Image Pre-training for Low-Level Vision
hongfz16/HCMoCo
[CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception
enyac-group/supmae
This is a offical PyTorch/GPU implementation of SupMAE.
VITA-Group/ViT-Anti-Oversmoothing
[ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wenqing Zheng, Tianlong Chen, Zhangyang Wang