masked-autoencoder

There are 67 repositories under masked-autoencoder topic.

OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
Language:Python1.5k 27 20194
keyu-tian/SparK
[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"
Language:Python1.5k 27 8987
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Language:Python1.4k 16 125137
EdisonLeeeee/Awesome-Masked-Autoencoders
A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).
796 32 152
Lupin1998/Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
Language:Python316 7 616
implus/UM-MAE
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
Language:Jupyter Notebook242 5 2220
uncbiag/SimpleClick
SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)
Language:Python217 6 3133
implus/mae_segmentation
reproduction of semantic segmentation using masked autoencoder (mae)
Language:Python160 3 714
xyzforever/BEVT
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
Language:Python158 7 1019
ruiwang2021/mvd
[CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)
Language:Python115 6 1011
TonyLianLong/CrossMAE
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
Language:Python99 4 55
habla-liaa/encodecmae
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
Language:Python91 4 24
zubair-irshad/NeRF-MAE
[ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields
Language:Python90 8 24
nttcslab/msm-mae
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations
Language:Jupyter Notebook89 7 68
Haochen-Wang409/HPM
[CVPR'23] Hard Patches Mining for Masked Image Modeling
Language:Python88 3 117
nttcslab/m2d
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework
Language:Jupyter Notebook81 4 63
rishikksh20/AudioMAE-pytorch
Unofficial PyTorch implementation of Masked Autoencoders that Listen
Language:Python65 3 36
HKUDS/MAERec
[SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"
Language:Python62 5 135
MCG-NJU/VideoMAE-Action-Detection
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
Language:Python55 2 65
recursionpharma/maes_microscopy
Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.
Language:Jupyter Notebook46 7 410
lyhkevin/MT-Net
Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)
Language:Python36 1 92
shlokk/mae-contrastive
Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".
Language:Python29 4 53
stoneMo/DeepAVFusion
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
Language:Python27 6 62
Westlake-AI/A2MIM
[ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN
Language:Python25 2 64
sunilhoho/EVEREST
Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].
Language:Python24 3 51
samsad35/VQ-MAE-S-code
A Vector Quantized Masked AutoEncoder for speech emotion recognition
Language:Python21 2 71
liruiw/Dec-SSL
Understanding Self-Supervised Learning in a non-IID Setting
Language:Python20 2 21
russellllaputa/MIRL
[NeurIPS 2023] Masked Image Residual Learning for Scaling Deeper Vision Transformers
Language:Python19 3 03
bayartsogt-ya/albert-mongolian
ALBERT trained on Mongolian text corpus
Language:Jupyter Notebook18 2 22
jakhac/CSMAE
Cross-Sensor Masked Autoencoder for Content Based Image Retrieval in Remote Sensing
Language:Python18 2 24
mvrl/BirdSAT
A PyTorch implementation of "BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping"
Language:Python17 1 04
yifanzhang-pro/M-MAE
Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)
Language:Python14 2 13
waldo-vision/models
Repository for model development and training
Language:Python13 5 94
naver-ai/lut
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
Language:Python12 4 0
JJLi0427/CNN_Masked_Autoencoder
Design a patches masked autoencoder by CNN
Language:Python11 2 00
Ryan21wy/HSIMAE
HSIMAE: A Unified Masked Autoencoder with large-scale pretraining for Hyperspectral Image Classification
Language:Python11 3 30

masked-autoencoder

OpenGVLab/InternVideo

keyu-tian/SparK

MCG-NJU/VideoMAE

EdisonLeeeee/Awesome-Masked-Autoencoders

Lupin1998/Awesome-MIM

implus/UM-MAE

uncbiag/SimpleClick

implus/mae_segmentation

xyzforever/BEVT

ruiwang2021/mvd

TonyLianLong/CrossMAE

habla-liaa/encodecmae

zubair-irshad/NeRF-MAE

nttcslab/msm-mae

Haochen-Wang409/HPM

nttcslab/m2d

rishikksh20/AudioMAE-pytorch

HKUDS/MAERec

MCG-NJU/VideoMAE-Action-Detection

recursionpharma/maes_microscopy

lyhkevin/MT-Net

shlokk/mae-contrastive

stoneMo/DeepAVFusion

Westlake-AI/A2MIM

sunilhoho/EVEREST

samsad35/VQ-MAE-S-code

liruiw/Dec-SSL

russellllaputa/MIRL

bayartsogt-ya/albert-mongolian

jakhac/CSMAE

mvrl/BirdSAT

yifanzhang-pro/M-MAE

waldo-vision/models

naver-ai/lut

JJLi0427/CNN_Masked_Autoencoder

Ryan21wy/HSIMAE