masked-autoencoder

There are 50 repositories under masked-autoencoder topic.

  • SparK

    keyu-tian/SparK

    [ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

    Language:Python1.4k268081
  • MCG-NJU/VideoMAE

    [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

    Language:Python1.2k16118124
  • OpenGVLab/InternVideo

    Video Foundation Models & Data for Multimodal Understanding

    Language:Python1k2812170
  • EdisonLeeeee/Awesome-Masked-Autoencoders

    A collection of literature after or concurrent with Masked Autoencoder (MAE) (Kaiming He el al.).

  • Lupin1998/Awesome-MIM

    [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)

    Language:Python2627614
  • implus/UM-MAE

    Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"

    Language:Jupyter Notebook23252220
  • uncbiag/SimpleClick

    SimpleClick: Interactive Image Segmentation with Simple Vision Transformers (ICCV 2023)

    Language:Python19062931
  • xyzforever/BEVT

    PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

    Language:Python15171018
  • implus/mae_segmentation

    reproduction of semantic segmentation using masked autoencoder (mae)

    Language:Python1473614
  • ruiwang2021/mvd

    [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv.org/abs/2212.04500)

    Language:Python92699
  • Haochen-Wang409/HPM

    [CVPR'23] Hard Patches Mining for Masked Image Modeling

    Language:Python813116
  • nttcslab/msm-mae

    Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations

    Language:Jupyter Notebook81767
  • TonyLianLong/CrossMAE

    Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders

    Language:Python74435
  • rishikksh20/AudioMAE-pytorch

    Unofficial PyTorch implementation of Masked Autoencoders that Listen

    Language:Python60426
  • HKUDS/MAERec

    [SIGIR'2023] "MAERec: Graph Masked Autoencoder for Sequential Recommendation"

    Language:Python50595
  • habla-liaa/encodecmae

    Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'

    Language:Python48313
  • nttcslab/m2d

    Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

    Language:Jupyter Notebook48351
  • MCG-NJU/VideoMAE-Action-Detection

    [NeurIPS 2022 Spotlight] VideoMAE for Action Detection

    Language:Python46252
  • lyhkevin/MT-Net

    Multi-scale Transformer Network for Cross-Modality MR Image Synthesis (IEEE TMI)

    Language:Python25182
  • shlokk/mae-contrastive

    Official implementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".

    Language:Python25451
  • liruiw/Dec-SSL

    Understanding Self-Supervised Learning in a Decentralized Setting

    Language:Python19221
  • sunilhoho/VideoMS

    Official Pytorch implementation of Efficient Video Representation Learning via Masked Video Modeling with Motion-centric Token Selection.

    Language:Python19311
  • bayartsogt-ya/albert-mongolian

    ALBERT trained on Mongolian text corpus

    Language:Jupyter Notebook18222
  • recursionpharma/maes_microscopy

    Official repo for Recursion's accepted spotlight paper at NeurIPS 2023 Generative AI & Biology workshop.

    Language:Python18534
  • Westlake-AI/A2MIM

    [ICML 2023] Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN

    Language:Python18232
  • jakhac/CSMAE

    Cross-Sensor Masked Autoencoder for Content Based Image Retrieval in Remote Sensing

    Language:Python16224
  • mvrl/BirdSAT

    A PyTorch implementation of "BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping"

    Language:Python13102
  • JJLi0427/CNN_Masked_Autoencoder

    Design a patches masked autoencoder by CNN

    Language:Python11200
  • stoneMo/DeepAVFusion

    Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".

  • waldo-vision/models

    Repository for model development and training

    Language:Python10594
  • samsad35/VQ-MAE-S-code

    A Vector Quantized Masked AutoEncoder for speech emotion recognition

    Language:Python9271
  • yifanzhang-pro/M-MAE

    Official implementation of Matrix Variational Masked Autoencoder (M-MAE) for ICML paper "Information Flow in Self-Supervised Learning" (https://arxiv.org/abs/2309.17281)

    Language:Python9203
  • jonahanton/SSL_audio

    Codebase for Imperial MSc AI Individual Project - Self-Supervised Learning for Audio Inference

    Language:Python8211
  • Video-MAC/VideoMAC

    Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”

    Language:Python8211
  • Ryan21wy/HSIMAE

    HSIMAE: A Unified Masked Autoencoder with large-scale pretraining for Hyperspectral Image Classification

    Language:Python731
  • ebird_project

    YunghuiHsu/ebird_project

    Extraction of deep features/representation of birds by deep learning algorithms.

    Language:Jupyter Notebook4101