masked-autoencoder

There are 77 repositories under masked-autoencoder topic.

  • OpenGVLab/InternVideo

    [ECCV2024] Video Foundation Models & Data for Multimodal Understanding

    Language:Python2k27277126
  • MCG-NJU/VideoMAE

    [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

    Language:Python1.6k15129154
  • SparK

    keyu-tian/SparK

    [ICLR'23 SpotlightšŸ”„] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

    Language:Python1.4k209085
  • EdisonLeeeee/Awesome-Masked-Autoencoders

    A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

  • Lupin1998/Awesome-MIM

    [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

    Language:Python3437617
  • implus/UM-MAE

    Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

    Language:Jupyter Notebook24442220
  • uncbiag/SimpleClick

    SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)

    Language:Python24453341
  • implus/mae_segmentation

    reproduction of semantic segmentation using masked autoencoder (mae)

    Language:Python1653714
  • xyzforever/BEVT

    PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

    Language:Python15971019
  • ruiwang2021/mvd

    [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

    Language:Python12451011
  • TonyLianLong/CrossMAE

    Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

    Language:Python118357
  • NeRF-MAE

    zubair-irshad/NeRF-MAE

    [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

    Language:Python104834
  • Haochen-Wang409/HPM

    [CVPR'23 & TPAMI'25] Hard Patches Mining for Masked Image Modeling & Bootstrap Masked Visual Modeling via Hard Patch Mining

    Language:Python1022127
  • habla-liaa/encodecmae

    Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

    Language:Python99425
  • nttcslab/msm-mae

    Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations

    Language:Jupyter Notebook94768
  • nttcslab/m2d

    Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

    Language:Jupyter Notebook92474
  • rishikksh20/AudioMAE-pytorch

    Unofficial PyTorch implementation of Masked Autoencoders that Listen

    Language:Python69236
  • MCG-NJU/VideoMAE-Action-Detection

    [NeurIPS 2022 Spotlight] VideoMAE for Action Detection

    Language:Python67267
  • HKUDS/MAERec

    [SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"

    Language:Python615135
  • recursionpharma/maes_microscopy

    Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.

    Language:Jupyter Notebook586612
  • lucidrains/LVMAE-pytorch

    Implementation of the proposed LVMAE, from the paper, Extending Video Masked Autoencoders to 128 frames, in Pytorch

    Language:Python49152
  • naver-ai/augsub

    [CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"

    Language:Python45201
  • lyhkevin/MT-Net

    Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)

    Language:Python36192
  • stoneMo/DeepAVFusion

    Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".

    Language:Python33562
  • Event-AHU/VFM-Det

    VFM-Det: Towards High-Performance Vehicle Detection via Large Foundation Models

    Language:Python31133
  • shlokk/mae-contrastive

    Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".

    Language:Python31453
  • samsad35/VQ-MAE-S-code

    A Vector Quantized Masked AutoEncoder for speech emotion recognition

    Language:Python28181
  • Westlake-AI/A2MIM

    [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

    Language:Python28163
  • jakhac/CSMAE

    Cross-Sensor Masked Autoencoder for Content Based Image Retrieval in Remote Sensing

    Language:Python26124
  • sunilhoho/EVEREST

    Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].

    Language:Python26351
  • liruiw/Dec-SSL

    Understanding Self-Supervised Learning in a non-IID Setting

    Language:Python20221
  • russellllaputa/MIRL

    [NeurIPS 2023] Masked Image Residual Learning for Scaling Deeper Vision Transformers

    Language:Python19303
  • Ryan21wy/HSIMAE

    HSIMAE: A Unified Masked Autoencoder with large-scale pretraining for Hyperspectral Image Classification

    Language:Python19230
  • bayartsogt-ya/albert-mongolian

    ALBERT trained on Mongolian text corpus

    Language:Jupyter Notebook18223
  • mvrl/BirdSAT

    A PyTorch implementation of "BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping"

    Language:Python18104
  • yifanzhang-pro/M-MAE

    Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)

    Language:Python14113