lucidrains/vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

PythonMIT

Issues

LayerNorm for Vit
#244 opened 2 years ago
0
Neighbourhood Attention Implementation
#243 opened 2 years ago
5
MAE `decoder_tokens` computation
#241 opened 2 years ago
1
How to retrain ViT
#240 opened 2 years ago
2
Support for loading pretrained weights into networks:
#239 opened 2 years ago
2
Tensors must have same number of dimensions : got 5 and 3
#238 opened 2 years ago
8
Quesiton about attention's qkv matrix
#237 opened 2 years ago
2
simplify `to_patch_embedding` using Conv2d
#236 opened 2 years ago
4
Issues loading RegionVIT pre-trained checkpoints
#235 opened 2 years ago
1
Loading weights of custom ViT models
#234 opened 2 years ago
1
Visualize the attention weights
#233 opened 2 years ago
1
Distillation RuntimeError
#232 opened 2 years ago
1
Attention maps for PiT
#230 opened 2 years ago
0
Question about example notebook
#229 opened 2 years ago
1
EfficientFormer Request!
#228 opened 2 years ago
1
what are the changes i should use for the large release?
#227 opened 2 years ago
0
Global average pool mean is taking cls token into account as well
#226 opened 2 years ago
0
How to calculate Params and Flops for Vit?
#225 opened 2 years ago
5
Although there is no bug in this line, '1 n d' should be '1 1 d' in my opinion, it would be confused.
#224 opened 2 years ago
2
MaxViT's MbConv doesn't match article.
#223 opened 2 years ago
1
Add another MLP head in vision transformer
#222 opened 2 years ago
2
Accessing last layer hidden states or embeddings for models like CrossViT, RegionViT (Extractor doesn't seem to work)
#221 opened 2 years ago
8
Attention maps rectangular input
#219 opened 2 years ago
0
Masked Auto Encoder, class token and linear probing
#218 opened 2 years ago
1
why not combine key and query linar layer into one
#217 opened 2 years ago
1
Can anyone show me the example of Distillation please
#215 opened 2 years ago
1
issue VIT + Loss function
#214 opened 2 years ago
0
my own datasets? train.py pre.py?
#213 opened 2 years ago
2
Did you miss dropout?
#209 opened 2 years ago
2
CCT and non-square images
#208 opened 2 years ago
2
A new idea
#207 opened 2 years ago
0
Question about code of `vit_for_small_dataset.py`
#206 opened 2 years ago
2
Could you release tutorials and examples as many as possible? And is the .zip file from the COCO dataset?
#205 opened 2 years ago
0
How to train custom dataset
#204 opened 2 years ago
1
Has a big time gap between GPU and CPU for Rearrange op.
#203 opened 2 years ago
0
Loading MAE pre-trained model for image classification fine tuning.
#202 opened 2 years ago
0
How to use Multi-Head Attention in ViT
#201 opened 2 years ago
1
I want to use pretrained VITMAE to embed patches without any masks which inturn I plan to feed into a decoder, Can someone point me in the correct direction ?
#200 opened 2 years ago
0
suggest any jupyter notebook or python code for use this library
#199 opened 2 years ago
1
RuntimeError: stack expects a non-empty TensorList
#198 opened 2 years ago
2
[BUG] efficient ViT does not support rectangular images
#197 opened 2 years ago
2
MAE using pretrained VIT
#196 opened 2 years ago
3
NesT implementation doesn't match the author/timm implementation
#192 opened 2 years ago
3
How to get the feature map of the vit encoder
#191 opened 2 years ago
4
does vit-pytorch has mvit?
#190 opened 2 years ago
1
Vit MAE decoder positional embeddings
#189 opened 2 years ago
2
Fine-tuning without loading position embedding from pre-trained model
#188 opened 2 years ago
0
Vit MAE reconstruction size mismatch
#187 opened 2 years ago
2
where is the train.py? tks
#185 opened 2 years ago
2
About different patch embedding in ViT
#184 opened 2 years ago
1