/MAE-pytorch

Primary LanguageJupyter Notebook

MAE PyTorch

Simple implementation of MAE, mask out some patches in input image, send patches not masked into a transformer encoder, cat encoder output with mask tokens into transformer decoder to reconstruct original image.

Result

model trained with mask_ratio=0.75, infer in different mask ratio.

  • mask_ratio = 0.5 img

  • mask_ratio = 0.75 img

  • mask_ratio = 0.90 img

  • mask_ratio = 0.95 img

TODO

  • train a vit on this dataset, and pretrain MAE on it, varify whether MAE pretrained can improve acc