masked-autoencoder

There are 67 repositories under masked-autoencoder topic.

  • OpenGVLab/InternVideo

    [ECCV2024] Video Foundation Models & Data for Multimodal Understanding

    Language:Python1.5k2720194
  • SparK

    keyu-tian/SparK

    [ICLR'23 SpotlightšŸ”„] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

    Language:Python1.5k278987
  • MCG-NJU/VideoMAE

    [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

    Language:Python1.4k16125137
  • EdisonLeeeee/Awesome-Masked-Autoencoders

    A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

  • Lupin1998/Awesome-MIM

    [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

    Language:Python3167616
  • implus/UM-MAE

    Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

    Language:Jupyter Notebook24252220
  • uncbiag/SimpleClick

    SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)

    Language:Python21763133
  • implus/mae_segmentation

    reproduction of semantic segmentation using masked autoencoder (mae)

    Language:Python1603714
  • xyzforever/BEVT

    PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

    Language:Python15871019
  • ruiwang2021/mvd

    [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

    Language:Python11561011
  • TonyLianLong/CrossMAE

    Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

    Language:Python99455
  • habla-liaa/encodecmae

    Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

    Language:Python91424
  • NeRF-MAE

    zubair-irshad/NeRF-MAE

    [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Radiance Fields

    Language:Python90824
  • nttcslab/msm-mae

    Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations

    Language:Jupyter Notebook89768
  • Haochen-Wang409/HPM

    [CVPR'23] Hard Patches Mining for Masked Image Modeling

    Language:Python883117
  • nttcslab/m2d

    Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

    Language:Jupyter Notebook81463
  • rishikksh20/AudioMAE-pytorch

    Unofficial PyTorch implementation of Masked Autoencoders that Listen

    Language:Python65336
  • HKUDS/MAERec

    [SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"

    Language:Python625135
  • MCG-NJU/VideoMAE-Action-Detection

    [NeurIPS 2022 Spotlight] VideoMAE for Action Detection

    Language:Python55265
  • recursionpharma/maes_microscopy

    Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.

    Language:Jupyter Notebook467410
  • lyhkevin/MT-Net

    Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)

    Language:Python36192
  • shlokk/mae-contrastive

    Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".

    Language:Python29453
  • stoneMo/DeepAVFusion

    Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".

    Language:Python27662
  • Westlake-AI/A2MIM

    [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

    Language:Python25264
  • sunilhoho/EVEREST

    Official Pytorch implementation of EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens [ICML2024].

    Language:Python24351
  • samsad35/VQ-MAE-S-code

    A Vector Quantized Masked AutoEncoder for speech emotion recognition

    Language:Python21271
  • liruiw/Dec-SSL

    Understanding Self-Supervised Learning in a non-IID Setting

    Language:Python20221
  • russellllaputa/MIRL

    [NeurIPS 2023] Masked Image Residual Learning for Scaling Deeper Vision Transformers

    Language:Python19303
  • bayartsogt-ya/albert-mongolian

    ALBERT trained on Mongolian text corpus

    Language:Jupyter Notebook18222
  • jakhac/CSMAE

    Cross-Sensor Masked Autoencoder for Content Based Image Retrieval in Remote Sensing

    Language:Python18224
  • mvrl/BirdSAT

    A PyTorch implementation of "BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping"

    Language:Python17104
  • yifanzhang-pro/M-MAE

    Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)

    Language:Python14213
  • waldo-vision/models

    Repository for model development and training

    Language:Python13594
  • naver-ai/lut

    [ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"

    Language:Python1240
  • JJLi0427/CNN_Masked_Autoencoder

    Design a patches masked autoencoder by CNN

    Language:Python11200
  • Ryan21wy/HSIMAE

    HSIMAE: A Unified Masked Autoencoder with large-scale pretraining for Hyperspectral Image Classification

    Language:Python11330